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METHODS AND COMPOSITIONS FOR ENELVNCING 
THE EFFICACY AND SPECIFICITY OF RNAi 

Government Rights 

5 This invention was made at least in part with government support xmder grant 

nos. ROl GM62862-01, GM6S236-01, and R21 NS44952-01 awarded by the National 
Institutes of Health. The government may have certain rights in this invention. 

* Related Applications 

10 This patent application claims the benefit of U.S. Provisional Patent AppUcations 

Serial No. 60/475,331, entitled "Methods and Compositions for Enhancing the Efficacy 
and Specificity of RNAi," filed June 2, 2003; U.S. Provisional Patent Application Serial 
No, 60/507,928 entitled 'Methods and Compositions for Enhancing the Efficacy and 
Specificity of RNAi," filed September 30, 2003; and U.S. Provisional Patent Application 

15 Serial No. 60/XXXXXX entitled "Methods and Compositions for Enhancing the 
Efficacy and Specificity of RNAi," filed May 28, 2004 and bearing attorney docket 
number UMY-066-3. The entire contents of the above-referenced provisional patent 
applications are incorporated herein by this reference. 

20 Background of thi Invention 

Two types of --21 nt RNAs trigger post-transcriptional gene silencing in animals: 
small interfermg RNAs (siRNAs) and microRNAs (miRNAs). Both siRNAs and 
miRNAs are produced by the cleavage of double-stranded RNA(dsRNA) precursors by 
Dicer, a of the RNase IH family of dsRNA-specific endonucleases (Bernstein et 

25 al.,2001; Billy et aL, 2001; Grishok et al., 2001; Hutvagner et aL, 2001; Ketting et al., 
2001;Knight and Bass, 2001; Paddison et aL, 2002; Park et aL, 2002; Provost et aL, 
2002;Reinhart et al., 2002; Zhang et al., 2002; Doi et aL, 2003; Myers et aL, 2003). 
siRNAs result when transposons, viruses or endogenous genes express long dsRNA or 
when dsRNA is introduced experunentally into plant or animal cells to trigger gene 

30 silencing, a process known as RNA interference (RNAi) (Fire et aL, 1998; Hamilton and 
Baulcombe, 1999; Zamore et al., 2000; Elbashir et aL, 2001a; Hammond et al., 2001; 
Sijen et al., 2001; Catalanotto et al., 2002). In contrast, miRNAs are the products of 
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endogenous, non-coding genes whose precursor SNA transcripts can form small stem- 
loops from which mature noiRNAs are cleaved by Dicer (Lagos-Qiiintana et al., 2001 ; 
Lau et al., 2001; Lee and Ambros, 2001; Lagos-Quintana et al., 2002; Mourelatos et al., 
2002; Reinhart et al., 2002; Ambros et al., 2003; Brennecke et al., 2003; Lagos-Quintana 
5 et al., 2003; Lim et al., 2003a; Lim et al., 2003b). miRNAs are encoded in genes distmct 
from the mRNAs whose egression they control. 

siRNAs were first identified as the specificity detenninants of the RNA 
interference (RNAi) pathway (Hamilton and Baulcombe, 1999; Hammond et al, 2000), 
^vhere they act as guides to direct endonucleolydc cleavage of their target RNAs 

1 0 (Zamore et al., 2000; Elbashir et al., 2001 a). Prototypical siRNA duplexes are 2 1 nt, 
double-stranded RNAs that contain 19 base pairs, with two-nucleotide, 3 ' overhanging 
ends (Elbashir et al., 2001a; Nykanen et al., 2001; Tang et al., 2003). Active siRNAs 
contain 5' phosphates and 3' hydroxyls (Zamore et al., 2000; Boutla et al., 2001; 
Nykanen et al., 2001; Chiu and Rana, 2002). Similarly, miRNAs contain 5' phosphate 

1 5 and 3 ' hydroxyl groups, reflecting their production by Dicer (Hutvdgner et al., 200 1 ; 
Malloryetal., 2002). 

In plants, miRNAs regulate the expression of developmentally important 
protems, often by directing mRNA cleavage (Rhoades et al., 2002; Reinhart et al, 2002; 
Llave et al, 2002a; Llave et al., 2002b; Xie et al., 2003; Kasschau et al., 2003; Tang et 

20 al., 2003; Chen, 2003). Whereas plant miRNA's show a high degree of complementarity 
to their niRNA targets, animal miRNA's have only limited complementarity to the 
mRNAs whose expression they control (Lee et al., 1993; Wightraan et al., 1993; Olsen 
and Ambros, 1999; Reinhart et al., 2000; Slack et al., 2000; Abrahante et al., 2003; 
Brennecke et al., 2003; Lin et al., 2003; Xu et al., 2003). Animal miRNAs are thought 

25 to repress mRNA translation, rather than promote target mRNA destruction (Lee et al., 
1993; Wrightman et al., 1993; Olsen and Ambross, 1999; Brennecke et al., 2003). 
Recent evidence suggests that the two classes of small RNAs are fimctionally 
interchangeable, with the choice of mRNA cleavage or translational repression 
determined solely by the degree of complementarity between the small RNA and its 

30 target (Hutvigner and Zamore, 2002; Doench et al., 2003). Furthermore, siRNAs and 
miRNAs are found in similar, if not identical complexes, suggesting that a single, 
bifimctional complex --^he RNA-induced silencing complex (RISC)- mediates both 
cleavage and translational control (Mourelatos et al., 2002; Hutv&gner and Zamore, 
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2002; Caudy et al., 2002; Martinez et al., 2002). Nonetheless, studies in both plants and 
animals show that at steady-state, siRNAs and miRNAs differ in at least one crucial 
respect: in vivo and in vitro, siKNAs are double-stranded, whereas miRNAs are single- 
stranded (Lee et al., 1993; Hamilton and Baulcombe, 1999; Pasquinelli et al., 2000; 

5 Reinhart et al., 2000; Elbashir et al., 2001a; Djikeng et al., 2001 ; Nykflnen et al., 2001; 
Lagos-Quintana et al., 2001; Lau et al., 2001; Lee and Ambros, 2001; Lagos-Quintana et 
al., 2002; l^einhart et al., 2002; Llave et al., 2002a; Silhavy et al., 2002; Llave et al., 
2002b; Tang et al., 2003). 

siRNA duplexes can assemble into RISC in the absence of target mRNA, both in 

10 vivo and in vitro (Tuschl et al., 1999; Hammond et al., 2000; Zamore et al., 2000). Each 
RISC contains only one of the two strands of the siRNA duplex (Martinez et al., 2002). 
Since siRNA diq)lexes have no foreknowledge of which siRNA strand will guide target 
cleavage, both strands must assemble with the appropriate proteins to form a RISC. 
Previously, we and others showed that bofli siRNA strands are competent to direct RNAi 

15 (Tuschl et al., 1999; Hammond et al., 2000; Zamore et al., 2000; Elbashir et al., 2001b; 
Elbashir et al., 2001a; Nykanen et al., 2001). That is, the anti-sense strand of an siRNA 
can direct cleavage of a corresponding sense RNA target, whereas the sense siRNA 
strand directs cleavage of an anti-sense target. In this way, siRNA duplexes appear to be 
fimctionally symmetric. The ability to control which strand of an siRNA duplex enters 

20 into the RISC complex to direct cleavage of a corresponding RNA target would provide 
a significant advance for both research and therapeutic appUcations of RNAi technology. 

Summary of the Invention 

A key step in RNA interference (RNAi) is the assembly of a catalytically active 
25 protein-RNA complex, the RNA-induced silencing complex (RISC), that mediates target 
RNA cleavage. The instant invention is based, at least in part, on the discovery that the 
two strands of a siRNA duplex do not contribute equally to RISC assembly. Rather, both 
the absolute and the relative stabilites of the base pah:s at the 5' ends of the two siRNA 
strands determines the degree to which each strand participates in the RNAi pathway. In 
30 fact, siRNA can be functionally asymmetric, with only one of the two strands able to 
trigger RNAi. The present invention is also based on the discovery that single stranded 
miRNAs are initially generated as siRNA-like duplexes whose structures predestine one 
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Strand to enter the RISC and the other strand to be destroyed. This finding helps to 
explain the biogenesis of single-stranded miRNAs; the miRNA strand of a short-lived, 
siRNA duplex-like intermediate is assembled into a RISC complex, causing miRNAs to 
accumulate in vivo as singje-stranded RNAs. 

5 The present invention is further based on the discovery that RISC can cleave 

RNA targets with up to five contiguous mismatches at the siRNA 5 ' end and eight 
mismatches at the siRNA 3' end, indicating that 5' bases contribute disproportionately to 
target RNA binding, but do not play a role in determining the catalytic rate, kcat. This 
• finding explains how the 5 \ central and 3 ' sequences of the siRNA guide strand fimction 

10 to direct target cleavage. 

The invention is further based on the discovery that the 3 ' bases of the siRNA 
contribute much less than 5' bases to the overall strength of binding, but instead help to 
establish the helical geometry required for RISC-mediated target cleavage, consistent 
with the view that catalysis by RISC requires a central A-form helix (Chiu et al., 2003). 
IS This finding indicates that complementarity is essential for translational repression by 
siRNAs designed to act like animal miRNAs, which typically repress translation 
(Doench et al., 2004). 

The present invention is further based on the discovery that when an siRNA feils 
to pair with the first three, four or five nucleotides of the target RNA, the phosphodiester 
20 bond severed in the target RNA is unchanged; for perfectly matched siRNA, RISC 
measures the site of cleavage fi:om the siRNA 5 " end (Elbashir et al., 2001; Elbashir et 
al., 2001). This finding indicates that the identity of the scissile phosphate is determined 
prior to the encounter of the RISC with its target RNA, perhaps because the RISC 
endonuclease is positioned with respect to the siRNA 5 ' end during RISC assembly. 

25 Accordingly, the instant invention features methods of enhancing the efficacy 

and specificity of RNAi. Also provided is a method of decreasing silencing of an 
inadvertent target by an RNAi agent. The invention further features compositions, 
uicludmg siRNAs, shRNAs, as well as vectors and transgenes, for mediating RNAi. The 
RNAi agents of the invention have improved specificity and efficacy in mediating 

30 silencing of a target gene. 

Other features and advantages of the invention will be apparent fix)m the 
following detailed description and claims. 
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Brief Description of the Drawings 

Figure L RNAi mediated by asymmetric duplex and single-stranded siRNAs. 
(A) Schematic showing relevant portions of the sense and anti-sense target KNA 
5 sequences. (B) Schematic showing siRNA duplex sequence and graph depicting RNAi 
mediated by the antisense and sense strands. (C) Schematic showing siRNA sequences 
of individual single strands and graph depicting RNAi mediated by the single strands. 
(D) Bar graph depicting fraction of total siRNA present as single-strand. (E) Schematic 
showing siRNA duplex sequence containing G:U wobble base pair and graph depicting 
10 RNAi mediated by the antisense and sense strands. 

Figure 2. RNAi mediated by asymmetric duplex siRNAs. (A) Schematic 
showing relevant portions of the sense and anti-sense target RNA sequences. (B) 
Schematic showing siRNA duplex sequence and gr^h depicting RNAi mediated by the 
antisense and sense strands. (C) Schematic showing siRNA duplex sequence containing 
15 A:U mismatch and graph depicting RNAi mediated by the antisense and sense strands. 
(D) Schematic showing siRNA duplex sequence containing G:U mismatch and gr25)h 
depicting RNAi mediated by the antisense and sense strands. (E) Schematic showing 
siElNA duplex sequence containing C:A mismatch and graph depicting RNAi mediated 
by the antisense and sense strands. 

20 Figure 3. RNAi mediated by asymmetric duplex siRNAs. (A) Schematic 

showing relevant portions of the sense and anti-sense target RNA sequences. 0^) 
Schematic showing siElNA duplex sequence and graph depicting RNAi mediated by the 
antisense and sense strands. (C) Schematic showing siRNA duplex sequence containing 
A:G mismatch and graph depicting RNAi mediated by the antisense and sense strands, 

25 (D) Schematic showing siRNA duplex sequence containing C:U mismatch and graph 
dq)icting RNAi mediated by the antisense and sense strands. (E) Schematic showing 
siRNA duplex sequence containing A:U base pair and gr^h depicting RNAi mediated 
by the antisense and sense strands. (F) Schematic showing siRNA diq>lex sequence 
containing A:G mismatch and graph depicting RNAi mediated by the antisense and 

30 sense strands. (G) Schematic showing slEINA duplex sequence containing C:U 

nciistnatch and graph depicting RNAi mediated by the antisense and sense strands. (H) 
Schematic showing sLRNA duplex sequence containing A:U base pair and graph 
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depicting KNAi mediated by the antisense and sense strands. (I) Schematic of 
individual single-strands of siKNAs and gr^h depicting RNAi mediated by the 
individual single-strands. 

Figure 4, RNAi mediated by asymmetric duplex siRNAs containing inosine. (A) 
5 Schematic showing siRNA duplex sequence having inosine at 5* end of sense strand and 
graph depicting RNAi mediated by the antisense and sense strands. (B) Schematic 
showing siRNA duplex sequence having inosine at 5' end of antisense strand and gr^h 
depictmg RNAi mediated by the antisense and sense strands. (C) Schematic showing 
siRNA duplex sequence containing insoine in both strands and gr^h depicting RNAi 
10 mediated by the antisense and sense strands. (D) Schematic showing individual siRNA 
strands containing inosine and graph depicting RNAi mediated by the individual single- 
strands. 

Figures. Symmetric cleavage of pre-/er-7 by Dicer, (A) Analysis of cleavage 
products produced on 5* side of precursor stem (let-7), (B) Analysis of cleavage 
15 products produced on 3' side of precursor stem (/^-7*). (C) Conceptual dicing of pre- 
fer-? to a deduced pre-/ei-7 siRNA. 

Figure 6. Analysis of Drosphila miRNA genes for predicted miRNA and 
miRNA*. (A) Conceptual dicing of26 published Drosphila miRNA genes to a deduced 
duplex siRNA. (B) Amounts ofnriR-10 and iniR-10* detected wvtvo. 

20 Figure 7, Schematic representing mechanism of RISC assembly from pre- 

miRNAordsRNA. 

Figure 5. Reduction of off-target silencing by sense strand. (A) Sense and anti- 
sense sodl target RNA sequences. (B) Schematic showing siRNA duplex sequence and 
graph depicting RNAi mediated by the antisense and sense strands. (C) Schematic 

2S showing siRNA duplex sequence containing G:U wobble base pair and graph depictmg 
RNAi mediated by the antisense and sense strands. (D) Schematic showing individual 
siRNA strands and gr^h depicting RNAi mediated by the individual single-strands. (E) 
Thermodynamic analysis of siRNA strand 5' ends for the siRNA duplex in (B). AG 
(kcal/mole) was calculated in IM NaCl at 

30 Figure 9, Enhancement of silencing by antisense strand. (A) Schematic 

showing relevant portions of the sense and anti-sense target RNA sequences. (B) 
Schematic showing siRNA duplex sequence and graph depicting RNAi mediated by the 
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antisense and sense strands. (B) Schematic showing siRNA duplex sequence containing 
A:U base pair and graph depicting RNAi mediated by the antisense and sense strands. 
(C) Schematic showing siRNA duplex sequence containing A:G mismatch and graph 
depicting RNAi mediated by the antisense and sense strands. (D) Thermodynamic 
5 analysis of siRNA strand 5 'ends for the siRNA duplex in (B). AG (kcal/mole) was 
calculated in IM NaCl at 37^*0. 

Figure 10. The relative thermodynamic stability of the first four base pairs 
of the siRNA strands explains siRNA fimctional asymmetry. Thermodynamic analysis 
of siRNA strand 5 ' ends for the siRNAs in Figures IB and IE. AG (kcal/mole) was 
10 calculated in IM NaCl at 

Figure 11 . The first four base pairs of the siRNA duplex determine strand- 
specific activity. Internal, single-nucleotide mismatches (A-F) near the 5' ends of an 
siRNA strand generate functional asymmetry, but internal G:U wobble pairs (G-I) do 
not 

1 5 Figure 12, Increased rate of siRNA efficiency when duplexes have dTdT 

mismatched tails. 

Figure 13. Product release limits the rate of catalysis by RISC, (a) ATP 
stimulates multiple rounds of RISC cleavage of the RNA taiget. siRNA was incubated 
with ATP in Drosophila embryo lysate, then NEM was added to quench RISC assembly 

20 and to disable the ATP-regenerating system. The energy regenerating system was either 
restored by adding additional creatine kinase (+ArP) or the reaction was ATP-depleted 
by adding hexokinase and glucose (-ATP). The target RNA concentration was 49 nM 
and the concentration of RISC was ~4 nM. The siRNA sequence is given in Figure 21. 
(b) In the absence of ATP, cleavage by RISC produces a pr&-steady state burst equal, 

25 within mror, to the concentration of active RISC. The target concentration was 110 nM 
and the RISC concentration was --4 nM. (c) Catalysis by RISC is not enhanced by ATP 
under single-turnover conditions. RISC was present in --S-fold excess over target. Each 
data point represents the average of two trials. 

Figure 14. In the absence of ATP, mismatches between the 3' end of the siRNA 
30 guide strand and the target RNA facilitate product release, but reduce the rate of target 
cleavage, (a) Representative siRNA sequences are shown aligned with the target 
sequence. The siRNA guide strand is in color (5 ' to 3 ') and the mismatch with the taiget 
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site is highlighted in yellow. A complete list of siRNA sequences appears in Figure 21 . 
(b) The steady-state rate of cleavage in the presence and absence of ATP was determined 
for siRNAs with zero to four 3 ' mismatches with Ae target site. The target RNA 
concentration was 49 nM and the concentration of RISC was either -4 nM (no 
5 mismatches) or -6 nM (1 to 4 mismatches). The steady-state velocity with ATP, relative 
to the velocity without ATP is shown for each siRNA. (c) Time course of cleavage for 
perfectly matched (--le-fold excess of RISC relative to target) and mismatched ('-SO-fold 
excess of RISC) siRNA. (d) Data representative of those used in the analysis in (c) for 
target cleavage directed by siRNAs with zero, four, and five 3 ' mismatches. 

10 * Figure 15. Remarkable tolerance of RISC for 3 ' mismatches, (a) Each additional 
3' mismatch further reduced the rate of cleavage by RISC. The steady-state rates of 
cleavage were determined for siRNA with zero, one, two, and four mismatches under 
multiple-turnover conditions (-49 nM target mRNA and -4-6 nM RISC), (b) Analysis 
of siRNAs bearing zero to five 3 ' mismatches with the target RNA under conditions of 

15 slight enzyme excess (-2-fold more RISC than target). siRNA sequences used m (a) and 
(b) are shown in Figure 14A and Figure 21 . (c) Extended endpoint analysis of RISC 
cleavage under conditions of -80-fold enzyme excess reveals that cleavage can occur for 
siRNAs with as many as eight mismatches to the target RNA. Note the diflferent time 
scales in (c) versus (b). All reactions were under standard in vitro RNAi (+ArP) 

20 conditions. 

Fig^re 16. Limited tolerance of RISC for 5' mismatches, (a) RISC cleavage was 
analyzed as in Figure 21 C using 5 ' mismatched siRNAs, whose sequences are given in 
Figure 21. The target RNA was the same for all siRNAs. (b) RISC cleavage was 
analyzed using a single siRNA sequence. Mismatches were created by altering the 

25 sequence of the target RNA. For the target containing con5)ensatory mutations, the 
target concentration was 0.25 nM and the siRNA concentration was -20 nM; RISC 
concentration was not determined. The asterisk denotes a 15 second time-point, (c) RISC 
cleavage was analyzed by incubating 50 nM siRNA with 0.5 nM target RNA. 3 ' 
mismatches were created by modifying the target sequence, and 5 ' mismatches by 

30 changing the siRNA. Target and siRNA sequences are given in Supplementary Figure 3. 
(d) Perfectly base-paired and 5' mismatched siRNAs direct cleavage at the same 
phosphodiester bond. Cleavage reactions were performed with -20 nM RISC generated 
from 50 nM siRNA and 0.5 nM target RNA and analyzed on an 8% denaturing 
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polyacrylamide sequencing geL The target mRNA was 182 nt and 5' cleavage product 
was 148 nt. After RISC was assembled, the extract was treated with NBM to inactivate 
nucleases (Schwartz et aL, 2004). After NEM treatment, the ATP regenerating system 
was restored by adding additional creatine Wnase, then target RNA was added and the 
5 incubation continued for the indicated time. OH- denotes a base hydrolysis ladder. 

Figure 17, Michaelis-Menten and Ki analysis for matched and mismatched 
siRNAs reveal distinct contributions to binding and catalysis for the 5', central, and 3' 
regions of the siRNA. (a) siRNA was assembled into RISC imder standard in vitro RNAi 
conditions, then diluted to achieve the desired RISC concentration. The initial rates_of . 

10 cleavage were determined for increasing concentrations of 5 ' 32P-cap-radiolabled target 
mRNA. Plot of initial velocity versus substrate concentration. KM and Vmax were 
determined by fitting the data to the Michaelis-Menten equation. See Table 1 for 
analysis. Representative initial rate determinations appear in Figure 20A. (b) Ki values 
were determined in competition assays using 2'-0-methyl oligonucleotides bearing 5', 

15 central, and 3 ' mismatches to the siRNA guide strand. Representative data are presented 
in Figure 20B, and a complete list of the 2'-0-methyl oligonucleotides used appears in. 
Figure 21. 

Figure 18, A model for the cycle of RISC assembly, target recogpition, catalysis, 
and recycling. 

20 Figure 19, Exogenously programmed RISC is a bona fide enzyme siRNA was 

assembled into RISC for 1 hour in a standard in vitro RNAi reaction, then assembly was 
quenched with N-ethyl malerraide (NEM)21,29. The amount of RISC formed was 
determined by measuring 32P-radiolabeled siRNA retained on a tethered 5 '-biotinylated, 
31-nt, 2'-0-methyl oligonucleotide complementary to the guide strand of the sLEUSlA 

25 RISC binds essentially irreversibly to tethered 2 '-0-methyl oligonucleotides, but cannot 
cleave these RNA-analogs (Hutvigner et al., 2004; Schwartz et al., 2003). In all 
experiments, target-cleaving activity was not detected in the supernatant, demonstrating 
that all the active RISC was retained on the beads, (a) Sequence of the siRNAused 
(guide strand in red, 32P-radiolabel marked with an asterisk). Drosophila let-7 is not 

30 expressed in 0-2 hour embryos (Hutvigner et al., 2001), so the only source of let-7 in 
the in vitro reactions was the exogenous let-7 siRNA. The 5' end of the guide strand of 
the let-7 siRNA is predicted to be thennodynamically more stable than the 5' end of the 
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passenger strand, explaining why only a low concentrations of let-7-prograimned RISC 
is formed (Schwartz et al., 2003, Khvorova et al., 2003). The maximum amount of RISC 
assembled varies widely with siRNA sequence. The siRNAs used in Figures 3-8 were 
designed to load =S-fold more guide strand-<:ontaining RISC (Hutvagner et al.» 2001; 
5 Schwartz et al, 2003) (b) Representative gel confinning that the RISC was removed by 
the tethered 2 '-0-methyl oHgonucleotide. A reaction prior to incubation with the 
tethered 2'**0-'m6thyl oligonucleotide (pre) was compared to the supernatant of a 
reaction incubated with beads alone (mock), and the supernatant of a reaction incubated 
with the complementary tethered 2'-0-methyl oligonucleotide (post). The buffer 
10 reaction contained no siRNA. (c) Analysis of the amount of RISC assembled at various 
siRNA concentrations. 5' 32P-radiolabeled siRNA was incubated with lysate for 1 hour, 
then reactions were quenched by treatment with NEM, and RISC concentration was 
measxired using the tethered 2'-0-methyl oligonucleotide method. 

Figure 20. Michaelis-Menton and Competitor Analysis of RISC (a) 
15 Representative data for the determination of initial velocities for the perfectlymatched 
siRNA. Black, 1 nM target; red, 5 nM; blue, 20 nM; and green, 60 nM. (b) Three 
independent experiments for inhibition by a folly complementary 2'-Omethyl 
oligonucleotide competitor. --1 nM RISC and 5 xiM 32P-^ap-radiolabeled target mRNA 
were incubated with increasing concentration of competitor, and the initial velocities 
20 were calculated and plotted versus competitor concentration. 

Figure 21. siRNAs, target sites, and 2'-0-methyl oUgonucleotides used in this 
study Table 1 Kinetic analysis of RISC. 

Detafled Description of the Invention 

25 A key step in RNA interference (RNAi) is the assembly of a catalytically active 

protein-RNA con[q)lex, the RNA-induced silencing complex (RISC), that mediates target 
RNA cleavage. Each RISC contains one of flie two strands of the small interfering RNA 
(siRNA) duplex that triggers RNAi. The instant invention is based, at least in part, on 
the discovery that the two siRNA strands do not contribute equally to RISC assembly. 

30 Small changes in siRNA sequence were found to have profound and predictable effects 
on the extent to which the two strands of an siRNA duplex enter the RNAi pathway, a 
phenomenon termed siRNA fooictional "asymmetry*. The discoveries described herein 
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reveal that the strength of the base-pairing interactions made by the 5 ' end of each 
siRNA strand with the 3 ' region of strand to which it is paired detemiines which of the 
two strands participates in the RNAi pathway. RISC assembly appears to be governed 
by an enzyme that mitiates unwinding of an siRNA duplex at the siRNA strand whose 5 ' 
5 end is less tightly paired to the complementary siRNA strand. 

Remarkably, such highly asymmetric siRNA duplexes resemble proposed 
intermediates in the biogenesis pathway of microRNA (miRNA) (Hutvagner and 
Zamore, 2002; Reinhart et al., 2002; Lim et al., 2003b). miRNAs are endogenous, -21- 
nt single-stranded RNAs processed by Dicer from stem-loop RNAprecursors that 

10 regulate gene expression in animals and plants. A striking feature of nuRNA precursors 
is their lack of full complementarity in the stem region. The discoveries presented 
herein indicate an important role for the discontinuities in the stem region of miRNAs; 
it is likely that miRNAs are initially generated from their precursor RNAs as siRNA-like 
duplexes, and that the stmcture of these duplexes predestines the miRNA strand to enter 

15 the RISC and the other strand to be destroyed. Thus, nature appears to have optimized 
the stem portion of miRNAs to follow a set of rules dictating which strand enters the 
RISC complex. 

The discoveries made by the instant inventors provide rules according to which 
siRNAs and shRNAs can be designed that are fiilly asymmetric, with only one of the 

20 two siRNA strands competent to enter the RISC complex. By applying these rules to the 
selection and design of a targeted RNAi agent, e.g,, siRNAs and shRNAs, the antisense 
strand of the RNAi agent can be predictably directed to enter the RISC complex and 
mediate target RNA cleavage. Similarly, tiie sense strand can be discouraged from 
entering the RISC complex, thereby reducing or eliminating undesfred silencing of an 

25 inadvertent target by the sense strand 

Accordingly, the instant invention provides methods for improving the efficiency 
(or specificity) of an RNAi reaction comprising identifying an oflF target RNAi activity 
mediated by the sense strand of an RNAi agent, and modifying the RNAi agent such that 
the base pair strength between the 5* end of the antisense strand and the 3' end of the 
30 sense strand is lessened relative to the base pair strength of the 5 ' end of the sense strand 
and the 3' end of the antisense strand (e.^ relative to the preniodified RNAi agent). 
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such that the sense strand is less efifective at entering RISC less effective than the 
premodified RNAi agent). 

The instant invention also provides methods for improving the efficiency (or 
specificity) of an RNAi reaction comprising modifying (e.g., increasing) the asymmetiry 
5 of the RNAi agent such that the ability of the sense or second strand to mediate RNAi 
(e.g., mediate cleavage of target RNA) is lessened; In preferred embodiments, the 
asynmietry is increased in favor of the 5' end of the first strand, e.g., lessening the bond 
strength (e.g., the strength of the interaction) between the 5* end of the first strand and 3* 
end of the second strand relative to the bond strength (e.g., the strength of the . 

10 interaction) between the 5' end of the second strand and the 3' end of the first strand. . 
In other embodiments, the asymmetry is increased in favor of the 5' end of the first 
strand by increasing bond strength (e.g., the strength of the interaction) between the 5' 
end of the second or sense strand and the 3' end of the first or antisense strand, relative 
to the bond strength (e,g., the strength of the interaction) between the 5' end of the first 

15 and the 3 ' end of the second strand. In embodiments of the invention, the bond strength 
is increased, e.g., the H bonding is increased between nucleotides or analogs at the 5* 
end, e.g,, within 5 nucleotides of the second or sense strand (numbered firom the 5' end 
of the second strand) and complemtary nucleotides of the first or antisense strand. It is 
understood that the asymmetry can be zero (i.e., no asymmetry), for example, when the 

20 bonds or base pairs between the 5' and 3' terminal bases are of the same nature, strength 
or structure. More routinely, however, there exists some asymmetry due to the different 
nature, strength or structure of at least one nucleotide (often one or more nucleotides ) 
between terminal nucleotides or nucleotide analogs. 

Accordingly, in one aspect, the instant invention provides a method of enhancing 
25 the ability of a first strand of a RNAi agent to act as a guide strand in mediatmg RNAi, 
involving lessening the base pair strength between the 5' end of the first strand and the 
3' end of a second strand of the duplex as compared to the base pair strength between 
the 3' end of the first strand and the 5' end of the second strand. 

In a related aspect, the invention provides a method of enhancing the efficacy of 
30 a siRNA duplex, the siRNA duplex comprising a sense and an antisense strand, 

involving lessening the base pair strength between the antisense strand 5* end (AS 5') 
and the sense strand 3' end (S 3') as compared to the base pair strength between the 
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antisense strand 3* end (AS 3') and the sense strand 5' end (S '5), such that efficacy is 
enhanced. 

another aspect of the mvention, a method is provided for promoting entry of a 
desired strand of an siRNA^ duplex into a RISC complex, comprising enhancing die 
5 asymmetry of the siRNA duplex, such that entry of the desired strand is promoted. In 
one embodiment of this aspect of the invention, the asymmetry is enhanced by lessening 
the base pair strength between the 5' end of the desired strand and the 3* end of a 
complementary strand of the duplex as compared to the base pair strength between the 
3' end of the desired strand and the 5' end of tihe complementary strand. 

10 In another aspect of the invention, a siKNA duplex is provided comprising a 

sense strand and an antisense strand, wherein the base pair strength between the 
antisense strand 5' end (AS 5') and the sense strand 3' end (S 3') is less flian the base 
pair strength between the antisense strand 3' end (AS 3') and the sense strand 5' end (S 
'5), such that the antisense strand preferentially guides cleavage of a target mRNA. 

15 In one embodiment of these aspects of the invention, the base-pair strength is 

less due to fewer G:C base pairs between the 5' end of the first or antisense strand and 
the 3' end of the second or sense strand than between the 3' end of the first or antisense 
strand and the 5' end of the second or sense strand. 

In another embodiment, the base pair strength is less due to at least one 

20 mismatched base pair between the 5' end of the first or antisense strand and the 3' end of 
the second or sense strand. Preferably, the mismatched base pair is selected fi-om the 
group consisting of G:A, C:A, C:U, G:G, A:A, C:C and U:U. 

In one embodiment, the base pair strength is less due to at least one wobble base 
pair, e.g., G:U, between the 5* end of the first or antisense strand and the 3* end of the 

25 second or sense strand. 

In another embodiment, the base pair strength is less due to at least one base pair 
comprising a rare nucleotide, e.g, mosine (I). Preferably, the base pair is selected firom 
the ffovp consisting of an I:A, I:U and I:C. 

In yet another embodiment, the base pair strength is less due to at least one base 

30 pair comprising a modified nucleotide. In preferred embodiments, the modified 

nucleotide is selected ftom the group consisting of 2-ainino-G, 2-amino-A, 2,6-diamino- 
G, and 2,6-diamino-A. 
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Li several embodiments of these aspects of the invention, the RNAi agent is a 
siKNA duplex or is derived ficom an engineered precursor, and can be chemically 
synthesized or enzymatically synthesized. 

In another aspect of the instant invention, compositions are provided comprising 
5 a siRNA duplex of the invention formulated to facilitate entry of the siKNA duplex into 
a cell Also provided are pharmaceutical composition comprising a siRNA duplex of the 
invention. 

Further provided are an engineered pre-miKNA comprising the siKNA duplex of 
any one of the preceding claims, as well as a vector encoding the pre-miRNA. In related 
10 aspects, the invention provides a pri-miRNA comprising the pre-miRNA, as well as a 
vector encoding the pri-miKNA. 

Also featured in the instant invention are small hairpin KNA (shKNA) 
comprising nucleotide sequence identical to the sense and antisense strand of the siRNA 
duplex of any one of the preceding claims. In one embodiment, the nucleotide sequence 
1 5 identical to the sense strand is upstream of the nucleotide sequence identical to the 
antisense strand. In another embodiment, the nucleotide sequence identical to the 
antisense strand is upstream of the nucleotide sequence identical to the sense strand. 
Further provided are vectors and transgenes encoding the shRNAs of the invention. 
In yet another aspect, the invention provides cells comprising the vectors 
20 featured in the instant invention. Preferably, the cell is a mammalian cell, e.g., a human 
ceU. 

In other aspects of the invention, methods of enhancing silencing of a target 
mRNA, comprising contacting a cell having an RNAi pathway with the RNAi agent of 
anyone of die preceding claims under conditions such that silencing is enhanced. 

25 Also provided are methods of enhancing silencing of a target mRNA in a subject, 

comprising administering to the subject a pharmaceutical composition comprising the 
RNAi agent of any one of the preceding claims such that silencing is enhanced. 

Further provided is a method of decreasing silencing of an inadvertant target 
mRNA by a dsRNAi agent, the dsRNAi agent comprising a sense strand and an 

30 antisense strand involving the steps of: (a) detecting a significant degree of 

complementarity between the sense strand and the inadvertant target; and (b) enhancing 
the base pair strength between the S' end of the sense strand and the 3' end of the 
antisense strand relative to the base pair strength between the 3' end of the sense strand 
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and the 5* end of the antisense strand; such that silencing of the inadvertant target 
mRNA is decreased. In a preferred embodiment, the silencing of the inadvertant target 
mRNA is decreased relative to silencing of a desured target mRNA 

5 So that the invention may be more readily understood, certain terms are first 

defined. 

The term **nucleoside" refers to a molecule having a purine or pyrimidine base 
covalently linked to a ribose or deoxyribose sugar. Exemplary nucleosides include 
admosine, guanosine, cytidine, uridine and thymidine. The term **nucleotide" refers to a 
10 nucleoside having one or more phosphate groups joined in ester linkages to the sugar 
moiety. Exemplary nucleotides include nucleoside monophosphates, diphosphates and 
triphosphates. The terms "polynucleotide" and "nucleic acid molecule" are used 
interchangeably herein and refer to a polymer of nucleotides joined together by a 
phosphodiester linkage Tjetween 5' and 3' carbon atoms. 

15 The term "RNA" or ''KNA molecule" or "ribonucleic acid molecule" refers to a 

polymer of ribonucleotides. The term "DNA" or "DNA molecule" or deoxyribonucleic 
acid molecule** refers to a polymer of deoxyribonucleotides. DNA and RNA can be 
synthesized naturally (e.g., by DNA replication or transcription of DNA, respectively). 
RNA can be post-transcriptionally modified DNA and RNA can also be chemically 

20 . synthesized. DNA and RNA can be single-stranded (i.e., ssRNA and ssDNA, 
respectively) or multi-stranded double stranded, i.e, dsRNA and dsDNA, 
respectively). "mRNA" or **messenger RNA" is smgle-stranded RNA that specifies the 
amino acid sequence of one or more polypeptide chains. This information is translated 
during protein synthesis when ribosomes bind to the mRNA. 

25 As used herein, the term "small interfering RNA" ("siRNA") (also referred to in 

the art as "short interfering RNAs") refers to an RNA (or RNA analog) comprising 
between about 10-50 nucleotides (or nucleotide analogs) which is capable of directing or 
mediating RNA interference. Preferably, an siRNA comprises between about 15-30 
nucleotides or nucleotide analogs, more preferably between about 16-25 nucleotides (or 

30 nucleotide analogs), even more preferably between about 18-23 nucleotides (or 
nucleotide analogs), and even more preferably between about 19-22 nucleotides (or 
nucleotide analogs) (e.^-, 19, 20, 21 or 22 nucleotides or nucleotide analogs). 
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As used herem, the term "rare nucleotide" refers to a naturally occurring 
nucleotide that occurs infrequently, including naturally occurring deoxyribonucleotides 
or ribonucleotides that occur infrequently, e.g., a naturally occurring ribonucleotide that 
is not guanosine, adenosine, cytosine, or uridine. Examples of rare nucleotides include, 
5 but are riot limited to, inosme, 1 -methyl inosine, pseudouridine, 5,6-dihydrouridine, 
riboihymidine, ^iV-methylguanosine and ^*^iV,iV-dimethylguanosine. 

The temi '^nucleotide analog" or "altered nucleotide" or "modified nucleotide" 
refers to a non-standard nucleotide, including non-naturally occurring ribonucleotides or 
deoxyribonucleotides. Preferred nucleotide analogs are modified.at any position so as to 

10 alter certain chemical properties of the nucleotide yet retain the ability of the nucleotide 
analog to perform its intended function. Examples of preferred modified nucleotides 
include, but are not limited to, 2-amino-guanosine, 2-amino-adenosine, 2,6-diamino- 
guanosine and 2,6-<liaminO"adenosine. Examples of positions of the nucleotide which 
maybe derivitized include the 5 position, e.g,, 5-(2-amino)propyl uridine, 5-bromo 

15 uridine, 5-propyne uridine, 5-propenyl uridine, etc.; the 6 position, e.g., 6-(2- 

amino)propyl uridine; the 8-position for adenosine and/or guanosines, e.g., 8-bromo 
guanosine, 8-chloro guanosine, 8-fluoroguanosine, etc. Nucleotide analogs also include 
deaza nucleotides, e.g,^ 7-deaza-adenosine; O- and N-modified (e.g., alkylated, N6- 
methyl adenosine, or as otherwise known in the art) nucleotides; and otiier 

20 heterocyclically modified nucleotide analogs such as those described in Herdewijn, 
Antisense Nucleic Add Drug Dev., 2000 Aug. 10(4):297-310. 

Nucleotide analogs may also comprise modifications to the sugar portion of the 
nucleotides. For example the 2' OH-group may be replaced by a group selected from H, 
OR, R, F, CI, Br, I, SH, SR, NH2, NHR, NR2, COOR, or OR, wherein R is substituted or 
25 unsubstituted Ci -Ce aUcyl, alkenyl, alkynyl, aryl, etc. Other possible modifications 
include those described in U.S. Patent Nos. 5,858,988, and 6,291,438. 

The phosphate group of the nucleotide may also be modified, e.g., by 
substituting one or more of the oxygens of the phosphate group with suUur (e.g., 
phosphorothioates), or by making other substitutions which allow the nucleotide to 
30 perform its intended function such as described in, for example, Eckstein, Antisense 
Nucleic Acid Drug Dev. 2000 Apr. 1 0(2): 1 17-21, Rusckowski et al Antisense Nucleic 
Acid Drug Dev. 2000 Oct. 10(5):333-45, SXwi^ Antisense Nucleic Acid Drug Dev. 2001 
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Oct. 11(5): 317-25, Vorobjeve^ a/. Antisense Nucleic Acid Drug Dev. 2001 Apr. 
1 1(2):77-85, and U.S. Patent No. 5,684,143. Certain of the above-referenced 
modifications (eg., phosphate group modifications) preferably decrease the rate of 
hydrolysis oi^ for example, polynucleotides comprising said analogs in vivo or in vitro. 

5 The term "oligonucleotide" refers to a short polymer of nucleotides and/or 

nucleotide analogs. The tenn 'TRNA analog" refers to an polynucleotide (e.g., a 
chemically synthesized polynucleotide) having at least one altered or modified 
nucleotide as compared to a corresponding unaltered or unmodified RNA but retaining 
the same or similar nature or function as the corresponding unaltered or unmodified , . 

10 RNA. As discussed above, the oligonucleotides may be linked with linkages which 
result m a lower rate of hydrolysis of the RNA analog as compared to an RNA molecule 
with phosphodiester linkages. For example, the nucleotides of the analog may comprise 
methylenediol, ethylene diol, oxymethylthio, oxyethylthio, oxycarbonyloxy, 
phosphorodiamidate, phophoroamidate, and/or phosphorothioate linkages. Preferred 

15 RNA analogues include sugar- and/or backbone-modified ribonucleotides and/or 

deoxyribonucleotides. Such alterations or modifications can further include addition of . 
non-nucleotide material, such as to the end(s) of the RNA or intemally (at one or more 
nucleotides of the RNA). An RNA analog need only be sufficiently similar to natural 
RNA that it has the ability to mediate (mediates) RNA interf^^ce. 

20 As used herein, the term ^'RNA interference" ("RNAi") (also referred to in the 

art as "gene silencing" and/or '*target silencing", e.g., **target mRNA silencing"*) refers to 
a selective intracellular degradation of RNA. RNAi occurs in cells naturally to remove 
foreign RNAs (e.g., viratRNAs). Natural RNAi.proceeds via firagments cleaved fi-om 
firee dsRNA which direct the degradative mechanism to other similar RNA sequences. 

25 Alternatively, RNAi can be initiated by the hand of man, for example, to silence tiie 
expression of target genes. 

As used herein, the term "antisense strand" of an slRNA or RNAi agent refers to 
a strand that is substantially complementary to a section of about 10-50 nucleotides, e.g., 
about 15-30, 16-25, 18-23 or 19-22 nucleotides of the mRNA of the gene targeted for 
30 silencing. The antisense strand or first strand has sequence sufficiently complementary 
to the desired target mRNA sequence to direct target-specific RNA interference (RNAi), 
e.g., complementarity sufficient to trigger the destruction of the desired target mRNA by 
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the RNAi machinery or process. The term "sense strand" or "second strand" of an 
siRNA or RNAi agent refers to a strand that is complementary to the antisense strand or 
first strand. Antisense and ssense strands can also be referred to as first or second 
strands, the first or second strand having complementarity to the target sequence and the 
5 respective second or first strand having complementarity to said first or second strand. 

As used herein, the term "guide strand" refers to a strand of an RNAi agent, e.g., 
an antisense strand of an siRNA duplex, that enters into the RISC complex and directs 
cleavage of the target mRNA, 

A "target gene" is a gene whose expression is to be selectively inhibited or 
10 "silenced." This silencing is achieved by cleaving the mRNA of the target gene by an 
siRNA or miRNA, e.g.^ an siRNA or miRNA that is created firom an engineered RNA 
precursor by a cell's RNAi system. One portion or segment of a duplex stem of the 
RNA precursor is an anti-sense strand that is complementary, e.g., sufficiently 
complementary to triggCT the destruction of the desired target mRNA by the RNAi 
IS machinery or process, to a section of about 1 8 to about 40 or more nucleotides of the 
mRNA of the target gene. 

The term "engineered," as in an engineered RNA precursor, or an engineered 
nucleic acid molecule, indicates that the precursor or molecule is not found in nature, in 
that all or a portion of the nucleic acid sequence of the precursor or molecule is created 
20 or selected by man. Once created or selected, the sequence can be replicated, translated, 
transcribed, or otherwise processed by mechanisms within a cell. Thus, an RNA 
precursor produced within a cell fi-om a transgene that includes an engineered nucleic 
acid molecule is an engineered RNA precursor. 

As used herein, the term "asymmetry", as in the asymmetry of a siRNA duplex, 
25 refers to an inequality of bond strength or base pairing strength between the siRNA 
termini (e.g., between terminal nucleotides on a first strand and terminal nucleotides on 
an opposing second strand), such that the 5' end of one strand of the duplex is more 
firequently in a transient uqpaired, e.g^ smgle-stranded, state than the S' end of the 
complementary strand. This stmctural difference determines that one strand of the 
30 duplex is preferentially incorporated into a RISC complex. The strand whose S' end is 
less tigihtly paired to the compl^entary strand will preferentially be incorporated into 
RISC and mediate RNAi. 
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As used herein, the term "bond strength" or 'Tjase pair strength" refers to the 
strength of the interaction between pairs of nucleotides (or nucleotide analogs) on 
opposing strands of an oligonucleotide duplex (e.g., an siRNA duplex), due primarily to 
H-boading, Van der Waals interactions, and the Uke between said nucleotides (or 
5 nucleotide analogs). 

As used herem, the "5* end", as in the 5' end of an antisense strand, refers to the 
5' tenninal nucleotides, e.g., between one and about 5 nucleotides at the 5* tenninus of 
the antisense strand. As used herein, the "3' end", as in the 3' end of a sense strand, 
. . refers to the region, e,g,^ a region of between one and about 5 nucleotides, that is 
10 complementary to the nucleotides of the 5' end of the complementary antisense strand. 

As used herein, the term "isolated RNA" (e.g., "isolated shRNA", "isolated 
siRNA" or "isolated RNAi agent") refers to RNA molecules which are substantially free 
of othor cellular material, or culture medium when produced by recombinant techniques, 
or substantially free of chemical precursors or other chemicals when chemically 
15 synthesized. 

As used herein, the term "transgene" refers to any nucleic add molecule, which 
is mserted by artifice into a cell, and becomes part of the genome of the organism that 
develops from the cell. Such a transgene may include a gene that is partly or entirely 
heterologous (i.e., foreign) to the transgenic organism, or may represent a gene 

20 homologous to an endogenous gene of the organism. The term "transgene** also means a 
nucleic acid molecule that includes one or more selected nucleic acid sequences, e.g., 
DNAs, that encode one or more engineered RNA precursors, to be expressed in a 
transgenic organism, e.g., animal, which is partly or entirely heterologous, i.e., foreign, 
to the transgenic animal, or homologous to an endogenous gene of the transgenic animal, 

25 but which is designed to be inserted into the animal's genome at a location which differs 
from that of the natural gene. A transgene includes one or more promoters and any other 
DNA, such as introns, necessary for expression of the selected micleic acid sequence, all 
operably linked to the selected sequence, and may include an enhancer sequence. 

The term "in vitrd^ has its art recognized meaning, e.g.^ involving purified 
30 reagents or extracts, cell extracts. The term "zw vivo" alos has its art recognized 
meaning involving Uving cells, immortalized cells, primary cells, cell lines, 
and/or cells in an organism. 
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A gene "involved" in a disorder includes a gene, the normal or aberrant 
expression or function of which effects or causes a disease or disorder or at least one 
symptom of said disease or disorder 

Various methodologies of the instant invention include step that involves 
5 comparing a value, level, feature, characteristic, property, etc. to a "suitable control", 
referred to interchangeably herein as an "appropriate control", A "suitable control" or 
"appropriate control", is any control or standard familiar to one of ordinary skill in the art 
useful for comparison purposes. In one embodiment, a "suitable control" or 
"appropriate control" is a value, level, feature, characteristic, property, etc. determined 
10 prior to performing an RNAi methodology, as described herein. For example, a 

transcription rate, mRNA level, translation rate, protein level, biological activity, cellular 
characteristic or property, genotype, phenotype, etc. can be determined prior to 
introducing an RNAi agent of the invention into a cell or organism. In another 
embodiment, a "suitable control" or "appropriate control" is a value, level, feature, 
15 characteristic, property, etc. determined in a cell or organism, e.g., a control or normal 
cell or organism, exhibiting, for example, normal traits. In yet another embodiment, a 
"suitable control" or "appropriate control" is a predefined value, level, feature, 
characteristic, property, etc. 

20 Various aspects of the invention are described in further detail in the following 

subsections. 

L RNA molecules and agents 

The present mvention features "small interfering RNA molecules" C'siRNA 

25 molecules" or "siRNA"), methods of making said siRNA molecules and methods (eg. , 

research and/or therapeutic methods) for using said siRNA molecules. An siRNA 

molecule of the invention is a duplex consisting of a sense strand and complementary 

antisense strand, the antisense strand having sufficient complementarity to a target 

mRNA to mediate RNAi. Preferably, the strands are aligned such that there are at least 

30 1, 2, or 3 bases at the end of the strands which do not align (/.a, for which no 

complementary bases occw in the opposing strand) such that an overhang of 1 , 2 or 3 

residues occurs at one or both ends of the duplex when strands are annealed- Preferably, 
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the siRNA molecule has a length from about 10-50 or more nucleotides, /.e., each strand 
comprises 10-50 nucleotides (or nucleotide analogs). More preferably, the siRNA 
molecule has a length from about 15-45 or 15-30 nucleotides. Even more preferably, the 
siRNA molecule has a length from about 16-25 or 18-23 nucleotides. The siKNA 

5 molecules of the invention fiirfher have a sequence that is "sufficiently complementary^* 
to a target mRNA sequence to direct target-specific RNA interference (RNAi), as 
defined herein, i.e., the siRNA has a sequence sufficient to trigger the destruction of the 
target mRNA by the RNAi machinery or process. 

siRNAs featured in the invention provide enhanced specificity and efScacy for 

10 mediating RISC-mediated cleavage of a desired target gene. In preferred aspect, the 
base pair strength between the antisense strand 5' end.(AS 5') and the sense strand 3' 
' end (S 3 ') of the siRNAs is less than the bond strength or base pair strength between the 
antisense strand 3' end (AS 3') and the sense strand 5' end (S *5), such that the antisense 
strand preferentially guides cleavage of a target mRNA. hi one embodiment, the bond 

15 strength or base-pair strength is less due to fewer G:C base pairs between the 5 ' end of 
the first or antisense strand and the 3' end of the second or sense strand than between the 
3' end of the first or antisense strand and the 5' end of the second or sense strand. 

In another embodiment, the bond strength or base pair strength is less due to at 
least one mismatched base pair between the 5' end of the first or antisense strand and the 

20 3' end of the second or sense strand. Preferably, the mismatched base pair is selected 
from the group consisting of G: A, C: A, C:U, G:G, A:A, C:C and U:U. In a related 
embodiment, the bond strength or base pair strength is less due to at least one wobble 
base pair, e.g., G:U, between the 5' end of the first or antisense strand and the 3' end of 
the second or sense strand. 

25 hi yet another embodiment, the bond strength or base pair strength is less due to 

at least one base pair comprising a rare nucleotide, e.g, inosine (T). Preferably, the base 
pair is selected from the groiq) consistmg of an I: A, I:U and I:C. 

In yet another embodiment, the bond strength or base pair strength is less due to 
at least one base pair comprising a modified nucleotide. Jn preferred embodiments, the 

30 modified nucleotide is selected from the group consisting of 2-amina-G, 2-amino-A, 
2,6-diamino-G, and 2,6-diamino-A. 
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In general, siRNA containing nucleotide sequences sufficiently identical to a 
portion of the target gene to effect RISC-mediated cleavage of the target gene are 
preferred. 100% sequence identity between the siRNA and the target gene is not 
required to practice the present invention. The invention has the advantage of being able 
5 to tolerate preferred sequence variations of the methods and compositions of the 

invention in order to enhance efficiency and specificity of RNAi. For example, siRNA 
sequences with insertions, deletions, and single point mutations relative to the target 
sequence can also be effective for inhibition. Alternatively, siRNA sequences with 
nucleotide analog substitutions or insertions can be effective for inhibition 

10 Sequence identity may determined by sequence comparison and alignment 

algorithms known in the art. To determine the percent identity of two nucleic acid 
sequences (or of two amino acid sequences), the sequences are aligned for optimal 
comparison purposes {e.g., gaps can be introduced in the first sequence or second 
sequence for optimal alignment). The nucleotides (or amino acid residues) at 

15 corresponding nucleotide (or amino acid) positions are then compared. When a position 
in the first sequence is occupied by the same residue as the corresponding position in the 
second sequrace, then the molecules are identical at that position. The percent identity 
between the two sequences is a fimction of the number of identical positions shared by 
the sequences (i.e, % homology = # of identical positions/total # of positions x 100), 

20 optionally penalizing the score for the number of gaps introduced and/or length of gaps 
introduced. 

The comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. la one embodiment, 
the alignment generated over a certain portion of the sequence aligned having sufficient 

25 identity but not over portions having low degree of identity (i.e., a local alignment). A 
preferred, non-limiting exannple of a local alignment algorithm utilized for the 
comparison of sequences is the algorithm of Elarlin and Altschul (1990) Proa Natl 
Acad, ScL USA 87:2264-68, modified as in Karhn and Altschul (1993) Proa Natl Acad. 
Set USA 90:5873-77. Such an algorithm is incorporated into the BLAST programs 

30 (version 2.0) of Altschul, et al (1990) J. Mol Biol 215:403-10. 
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In another embodiment, the alignment is optimized by introducing appropriate 
gaps and percent identity is determined over the length of the aligned sequences (i.e,^ a 
gapped alignment). To obtain gapped alignments for comparison purposes. Gapped 
BLAST can be utilized as described in Altschul et al^ (1997) Nucleic Acids Res. 
5 25(17):3389-3402. In another embodiment, the alignment is optimized by introducing 
appropriate gaps and percent identity is determined over the entire length of the 
sequences aligned (i.e,, a global alignment). A preferred, non-limiting example of a 
mathematical algorithm utilized for the global comparison of sequences is the algorithm 
of Myers and Miller, CABIOS (1989). Such an algorithm is incorporated into the 
10 ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizmg the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. 

Greater than 80% sequence identity, 80%, 81%, 82%, 83%, 84%, 85%, 

15 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or even 
100% sequence identity, between the sitlNA antisense strand and the portion of the 
target gene is preferred. Alternatively, the siRNA may be defined functionally as a 
nucleotide sequence (or oligonucleotide sequence) that is capable of hybridizing with a 
portion of the target gene transcript (e.g., 400 mM NaCl, 40 mM PIPES pH 6.4, 1 mM 

20 EDTA, 50'C or 70'C hybridization for 12-16 hours; followed by washing). Additional 
preferred hybridization conditions include hybridization at 70^C in IxSSC or 50^C in 
IxSSC, 50% formamide followed by washing at 70^C in 0.3xSSC or hybridization at 
70X in 4xSSC or 50X in 4xSSC, 50% fortnamide foUowed by washing at 6TC in 
IxSSC. The hybridization temperature for hybrids anticipated to be less than 50 base 

25 pairs in length should be 5-10**C less than the melting temperature (Tm) of the hybrid, 
where Tm is detennined according to the following equations. For hybrids less than 18 
base pairs in length, Tm(^C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids 
between 18 and 49 base pairs in length, Tni(**Q = 81.5 + 16.6(loglO[Na+]) + 
0.41 (%G+C) - (600/N), where N is the number of bases in the hybrid, and [Na+] is the 

30 concentration of sodium ions in the hybridization buffer ([Na+] for IxSSC = 0.165 M). 
Additional examples of stringency conditions for polynucleotide hybridization are 
provided in Sambrook, J., EJF. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A 
Laboratory Manual^ Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
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chaptCTS 9 and 11, and Current Protocols in Molecular Biology^ 1995, F.M. Ausubel et 
al., eds., John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incoiporated herein by 
reference. The length of the identical nucleotide sequences may be at least about 10, 12, 
15, 17, 20, 22, 25, 27, 30, 32, 35, 37, 40, 42, 45, 47 or 50 bases. 

5 The RNA molecules of the present invention can be modified to improve 

stability in serum or in growth medium for cell cultures. In order to enhance flie 
stability, the 3 -residues maybe stabilized against degradation, e.g., they may be selected 
such that tliey consist of purine nucleotides, particularly adenosine or guanosine 
nucleotides. Alternatively, substitution of pyrimidine nucleotides by modified 

10 analogues, eg. , substitution of uridine by 2*-deoxythymidine is tolerated and does not 
affect the efficiency of KNA mterference. 

ii a preferred aspect, the invention features small interferiog RNAs (siRNAs) 
that include a sense strand and an antisense strand, wherein the antisense strand has a 
sequence sufficiently complementary to a target mRNA sequencb to direct target- 

\ 5 specific KNA mterference (RNAi) and wherein the sense strand and/or antisense strand 
is modified by the substitution of internal nucleotides with modified nucleotides, such 
that in ^dvo stability is enhanced as compared to a corresponding unmodified siRNA. 
As defined herein, an "internal'* nucleotide is one occurring at any position other than 
the 5' end or 3' end of nucleic acid molecule, polynucleotide or oligonucleoitde. An 

20 internal nucleotide can be within a single-stranded molecule or within a strand of a 
duplex or double-stranded molecule. In one embodiment, the sense strand and/or 
antismse strand is modified by the substitution of at least one internal nucleotide. In 
anottier embodiment, the sense strand and/or antisense strand is modified by the 
substitution of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 

25 22, 23, 24, 25 or more internal nucleotides, hi another embodiment, the sense strand 
and/or antisense strand is modified by the substitution of at least 5%, 10%, 15%, 20%, 
25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 
more of the internal nucleotides. In yet another embodiment, the sense strand and/or 
antisense strand is modified by the substitution of all of the internal nucleotides. 

30 In a preferred embodunent of the present invention the RNA molecule may 

contain at least one modified nucleotide analogue. The nucleotide analogues may be 

located at positions where the target-specific activity, eg, the RNAi mediatmg activity 

is not substantially effected, e.g., in a region at the 5 -end and/or the 3'-end of the RNA 
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molecule. Particularly, tilie ends may be stabilized by incorporating modijBed nucleotide 
analogues.. 

Preferred nucleotide analogues include sugar- and/or backbone-modified 
ribonucleotides (i.e., include modifications to the phosphate-sugar backbone). For 
5 example, the phosphodiester linkages of natural KNTA may be modified to include at 
least one of a nitrogen or sulfiir heteroatom. In preferred backbone-modified 
ribonucleotides the phosphoester group connecting to adjacent ribonucleotides is 
replaced by a modified group, eg., of phosphothioate group. In preferred sugar- 
modified ribonucleotides, tihie T OH-group is replaced by a group selected firom H, OR, 
10 R, halo, SH, SR, NH2, jNHR, NR2 or ON, wherein R is Ci-Ce alkyl, alkenyl or alkynyl 
and halo is F, CI, Br or I. 

Also preferred are nucleobase-modified ribonucleotides, le.y ribonucleotides, 
containing at least one non-naturally occurring nucleobase instead of a naturally 
occurring nucleobase. Bases may be modified to block the activity of adenosine 

1 5 deaminase. Exemplary modified nucleobases include, but are not limited to, uridine 
and/or cytidine modified at the 5-position, e.^., 5-(2-amino)propyl uridine, 5-bromo 
uridine; adenosine and/or guanosines modified at the 8 position, 8-bromo 
guanosine; deaza nucleotides, eg., 7-deaza-adenosine; O- and N-alkjdated nucleotides, 
eg., N6-methyl adenosine are suitable. It should be noted that the above modifications 

20 may be combined. 

KNA may be produced enzymatically or by partial/total organic synthesis, any 
modified nibonucleotide can be introduced by in vitro enzymatic or organic synthesis. 
In one embodiment, an RNAi agent is prepared chemically. Methods of synthesizing 
RNA molecules are known in the art, in particular, the chemical synthesis methods as de 

25 scribed in Verma and Eckstein (1998) Annul Rev. Biochem. 67:99-134. In another 
embodiment, a ss-siRNA is prq)ared enzymatically. For example, a ds-siRNA can be 
prepared by enzymatic processing of a long ds RNA having sufficient complementarity 
to the desired target mRNA. Processing of long ds RNA can be accomplished in vitro, 
for example, using appropriate cellular lysates and ds-siRNAs can be subsequently 

30 purified by gel electrophoresis or gel filtration. ds-siRNA can then be denatured 

according to art-recognized methodologies. In an exemplary embodiment, RNA can be 
purified firom a mixture by extraction with a solvent or resin, precipitation. 
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electrophoresis, chromatography, or a combination thereof. Alternatively, the RNA may 
be used with no or a minimiun of purification to avoid losses due to sample processing. 
Alternatively, the siRNA can also be prepared by enzymatic transcription firom synthetic 
DNA templates or from DNA plasmids isolated fi:om recombinant bacteria. Typically, 
5 phage KNA polymerases are used such as T7, T3 or SP6 RNA polymerase (Milligan and 
lMGnhGc]c(\9S9) Methods Enzymol 180:51-62). The RNA may be dried for storage or 
dissolved in an aqueous solution. The solution may contain buffers or salts to inhibit 
annealing, and/or promote stabihzation of the single strands. 

hi one embodiment, the target mRNA of the invention specifies the amino acid 

10 sequence of a cellular protein (e^., a nuclear, cytoplasmic, transmembrane, or 
membrane-associated protein), hi another embodiment, the target mRNA of the 
invention specifies the amino acid sequence of an extracellular protein (e.g., an 
extracellular matrix protein or secreted protein). As used herein, the phrase "specifies 
the amino acid sequence" of a protein means that the mRNA sequence is translated into 

15 the amino acid sequence according to the rules of the genetic code. The following 
classes of proteins are Ksted for illustrative purposes: developmental proteins (e.g., 
adhesion molecules, cyclin kinase inhibitors, Wnt family members. Pax family 
members. Winged heUx family members, Hox family members, cytokines/lymphokmes 
and their recqptors, growth/differentiation fectors and their receptors, neurotransmitters 

20 and then: receptors); oncogene-encoded proteins ie.g. , ABLI, BCLI, BCL2, BCL6, 
CBFA2, CBL, CSFIR, ERBA, ERBB, EBRB2, ETSI, ETSI, ETV6, FGR, FOS, FYN, 
HCR, ERAS, JUN, KRAS, LCK, LYN, MDM2, MLL, MYB. MYC, MYCU, MYCN, 
NEIAS, PIM I, PML, RET, SRC, TAU, TCL3, and YES); tumor suppressor proteins 
(e.g., APC, BRCAl, BRCA2, MADH4. MCC, NF I, NF2, RB I, TP53, and WTI); and 

25 enzymes (eg., ACC synthases and oxidases, ACP desatuiases and hydroxylases, ADP- 
glucose pyrophorylases, ATPases, alcohol dehydrogenases, amylases, 
amyloglucosidases, catalases, cellulases, chalcone synthases, chitinases, 
cyclooxygenases, decarboxylases, dextiiinases, DNA and RNA polymerases, 
galactosidases, glucanases, glucose oxidases, granule-bound starch synthases, GTPases, 

30 helicases, hemicellulases, uitegrases, inulinases, invertases, isomerases, kinases, 
lactases, lipases, lipoxygenases, lysozymes, nopaline synthases, octopine synthases, 
pectinesterases, peroxidases, phosphatases, phospholipases, phosphorylases, phytases, 
plant growth regulator synthases, polygalacturonases, proteinases and peptidases, 
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puUanases, recombinases, reverse transcriptases, RUBISCOs, topoisomerases, and 
. xylanases). 

In a preferred aspect of the invention, the target mRNA molecule of the 
invention specifies the amino acid sequence of a protein associated with a pathological 

5 condition. For example, the protein may be a pathogen-associated protein (e.g. , a viral 
protein involved in immunosuppression of the host, replication of the pathogen, 
transmission of the pathogen, or maintenance of the infection), or a host protein which 
facilitates entry of the pathogen into the host, drug metabolism by the pathogen or host, 
rephcation or integration of the pathogen's genome, establishment or spread of infection 

10 in the host, or assembly of the next generation of pathogen. Altematively, the protein 
may be a tumor-associated protein or an autohnmune disease-associated protein. 

In one embodiment, the target mRNA molecule of the invention specifies the 
amino acid sequence of an endogenous protein (z.e., a protein present in the genome of a 
cell or organism). In another embodiment, the target mRNA molecule of the invention 

1 5 specified the ammo acid sequence of a heterologous protein expressed in a recombinant 
cell or a genetically altered organism. In another embodiment, the target mRNA 
molecule of the invention specified the amino acid sequence of a protein encoded by a 
transgene (i.e., a gene constmct inserted at an ectopic site in the genome of the cell). In 
yet another embodunent, the target mRNA molecule of the invention specifies the amino 

20 acid sequence of a protein encoded by a pathogen genome which is capable of infecting 
a cell or an organism firom which the cell is derived. 

By inhibiting the expression of such proteins, valuable information regarding the 
fimction of said proteins and therapeutic benefits which may be obtained from said 
inhibition may be obtained. 

25 In one embodiment, siRNAs are synthesized either in vivo, in situ, or in vitro. 

Endogenous RNA polymerase of the cell may mediate transcription in vivo or in situ, or 
cloned RNA polymerase can be used for transcription in vivo or in vitro. For 
transcription from a transgene in vivo or an expression construct, a regulatory region 
(g.g., promoter, enhancer, silencer, splice donor and acceptor, polyadenylation) may be 

30 used to transcribe the ss-siRNA. Inhibition may be targeted by specific transcription in 
an organ, tissue, or cell type; stimulation of an enviroxmiental condition (e.g., infection, 
stress, temperature, chemical inducers); and/or engineering transcription at a 
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developmental stage or age. A transgenic organism that expresses ss-siRNA from a 
recombinant construct may be produced by introducing the construct into a zygote, an 
embryonic stem cell, or another multipotent cell derived fix)m the appropriate organism. 

5 n. Short hairpin RNAs (shRNAs) 

In certain featured embodiments, the instant invention provides shRNAs having 
enhanced specificity or efficacy in mediating RNAi. In contrast to short siRNA 
duplexes, short hairpin RNAs (shRNAs) mimic the natural precursors of miRNAs and 
enter at the top of the RNAi pathway. For this reason, shRNAs axe beUeved to mediate 
1 0 RNAi more efficiently by being fed through the entire natural RNAi pathway. 

A preferred shRNA of the invention is one that has been redesigned for increased 
specificity or enhancement relative to a previous shRNA. The new shRNA differs firom 
a previous shRNA m that an siRNA duplex produced from the new shRNA has less base 
pair strength between the'5* end of the antisense strand or first strand and the 3' end of 
15 the sense strand or second strand than the base pair strength between the 3 ' end of the 
antisense strand or first strand and the 5' end of the sense strand or second strand. 

1. Engineered RNA Precursors That Generate siRNAs 

Naturally-occurring miRNA precursors (pre-miRNA) have a single strand that 
20 forms a duplex stem including two portions that are generally complementary, and a 
loop, that connects the two portions of the stem. In typical pre-miRNAs, the stem 
includes one or more bulges, e.g., extra nucleotides that create a single nucleotide "loop" 
in one portion of the stem, and/or one or more unpaired nucleotides that create a gap in 
the hybridization of the two portions of the stem to each other. Short hairpin RNAs, or 
25 engineered RNA pr^ursors, of the iQvention are artificial constructs based on these 
naturally occurring pre-miRNAs, but which are engineered to deliver desired siRNAs. 

In shRNAs, or engineered precursor RNAs, of the instant invention, one portion 
of the duplex stem is a nucleic acid sequence that is complementary (or anti-sense) to the 
target mRNA. Thus, engineered RNA precursors include a duplex stem with two 
30 portions and a loop connecting the two stem portions. The two stem portions are about 
18 or 19 to about 25, 30, 35, 37, 38, 39, or 40 or more nucleotides in length. When used 
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in mammalian cells, the length of the stem portions should be less than about 30 
nucleotides to avoid provoking non-specific responses like the interferon pathway. In 
non-manMnalian cells, the stem can be longer than 30 nucleotides. In fact, the stem can 
include much larger sections complementary to the target mRNA (up to, and 

5 including the entire mRNA). The two portions of the duplex stem must be sufficiently 
complementary to hybridize to form the duplex stem. Thus, flie two portions can be, but 
need not be, fully or perfectly complementary. In addition, the two stem portions can be 
the same length, or one portion can include an overhang of 1, 2, 3, or 4 nucleotides. The 
overhanging nucleotides can include, for example, luracils (Us), e.g., all Us. The loop in 

10 the shRNAs or engineered RNA precursors may differ from natural pre-miRNA 

sequences by modifying the loop sequence to increase or decrease the number of paired 
nucleotides, or replacing all or part of the loop sequence with a tetraloop or other loop 
sequences. Thus, the loop in the shRNAs or engineered RNA precursors can be 2, 3, 4, 
5, 6, 7, 8, 9, or more, e.g., 15 or 20, or more nucleotides in length. 

1 5 shRNAs of the invention include the sequences of the desired siRNA duplex. 

The desired siRNA duplex, and thus both of the two stem portions in the engineered 
RNA precursor, are selected by methods known in the art. These include, but are not 
limited to, selecting an 18, 19, 20, 21 nucleotide, or longer, sequence from the target 
gene mRNA sequence from a region 100 to 200 or 300 nucleotides on the 3* side of the 

20 start of translation. In general, the sequence can be selected from any portion of the 
mRNA from the target gene, such as the 5* UTR (untranslated region), coding sequence, 
or 3' UTEL This sequence can optionally foUow immediately after a region of the target 
gene containing two adjacent AA nucleotides. The last two nucleotides of the 21 or so 
nucleotide sequence can be selected to be UU (so that the anti-sense strand of the siRNA 

25 begins with UU). This 21 or so nucleotide sequence is used to create one portion of a 
duplex stem in the engineered RNA precursor. This sequence can replace a stem portion 
of a wild-type pre-stRNA sequence, e.g., enzymatically, or is mcluded in a complete 
sequence that is synthesized. For example, one can syn&esize DNA oligonucleotides 
that encode the entire stem-loop engmeered RNA precursor, or that encode just the 

30 portion to be inserted into the duplex stem of the precursor, and using restriction 

enzymes to build the engineered RNA precursor construct, from a wild-type pre- 
stRNA. 
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Engineered RNA precursors include in the duplex stem the 21-22 or so 
nucleotide sequences of the siRNA desired to be produced in vivo. Thus, the stem 
portion of the engineered RNA precursor includes at least 18 or 19 nucleotide pairs 
corresponding to the sequence of an exonic portion of the gene whose expression is to be 
5 reduced or inhibited. The two 3' nucleotides flanking this region of the stem are chosen 
so as to maximize the production of the siRNA from the engineered RNA precursor, and 
to maximize the efficacy of the resulting siRNA in targeting the corresponding mRNA 
for destruction by RNAi in vivo and in vitro. 

Another defining feature of these engineered KNA precursors is that as a 
10 consequence of their length, sequence, and/or structure, they do not induce sequence 
non-specific responses, such as induction of the interferon response or apoptosis, or that 
they induce a lower level of such sequence non-specific responses than long, double- 
stranded RNA (>150bp) that has been used to induce RNAi. For example,the interferon 
response is triggered by dsRNA longer than 30 base pairs, 

15 

2. Transeenes Knrnditip; Engineered RNA Precursors 

The new engineered RNA precursors can be synthesized by standard methods 
known in the art, e.g., by use of an automated DNA synthesizer (such as are 
commercially available from Biosearch, Applied Biosystems, etc.). These synflietic, 

20 engineered RNA precursors can be used durectly as described below or cloned into 
expression vectors by methods known in the field. The engineered RNA precursors 
should be delivered to cells in vitro or in vivo in which it is desired to target a specific 
mRNA for destmction. A number of methods have been developed for delivering DNA 
or RNA to cells. For example, for in vivo delivery, molecules can be injected directly 

25 into a tissue site or administered systemically. In vitro delivery includes methods known 
in the art such as electroporation and lipofection. 

To achieve intracellular concentrations of the nucleic acid molecule sufficient to 
suppress expression of endogenous mRNAs, one can use, for example, a recombinant 
DNA constract in which the oligonucleotide is placed under the control of a strong Pol 
30 in (e.g., U6 or PolIR Hl-RNA promoter) or Pol U promoter. The use of such a construct 
to transfect target cells in vitro or in vivo will result in the transcription of sufficient 
amounts of the engineered RNA precursor to lead to ihe production of an siRNA that 
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can target a corresponding mRNA sequence for cleavage by KNAi to decrease the 
expression of the gene encoding that mRNA. For example, a vector can be introduced in 
vivo such that it is taken up by a cell and directs the transcription of an engineered RNA 
precursor. Such a vector can remain episomal or become chiomosomally integrated, as 
5 long as it can be transcribed to produce the desired stiRNA precursor. 

Such vectors can be constructed by recombinant DNA technology methods 
known in the art. Vectors can be plasmid, viral, or other vectors known in the art such 
as those described herein, used for replication and expression m mammalian cells or 
other targeted cell types. The nucleic acid sequences encoding the engineered RNA 

10 precursors can be prepared using known techniques. For example, two synthetic DNA 
oligonucleotides can be synthesized to create a novel gene encoding the entire 
engineered RNA precursor. The DNA oligonucleotides, which will pair, leaving 
appropriate 'sticky ends' for cloning, can be inserted into a restriction site in a plasmid 
that contains a promoter sequence (e.g., a Pol n or a Pol III promoter) and appropri ate 

1 5 terminator sequences 3' to the enginered RNA precursor sequences (e.g., a cleavage and 
polyadenylation signal sequence from SV40 otslPoI HL terminator sequence). 

The invmtion also encompasses genetically engineered host cells that contain 
any of the foregoing ejtpression vectors and thereby express the nucleic acid molecules 
of the invention in the host cell. The host cells can be cultured using known techniques 
20 and methods (see, Culture of Animal Ctells 0^.1. Freshney, Alan R. Liss, Inc. 1 987); 
Molecular Cloning, Sambrook et al. (Cold Spring Harbor Laboratory Press, 1989)). 

Successful introduction of the vectors of the invention into host cells can be 
monitored using various known methods. For example, transient transfection can be 
signaled with a reporter, such as a fluorescent marker, such as Green Fluorescent Protein 
25 (GFP). Stable transfection can be indicated using markers that provide the transfected 
cell with resistance to specific environmental factors (e.g., antibiotics and drugs), such 
as hygromycin B resistance, eg^., in insect cells and in mammalian cells. 

3. Reeulatorv Sequences 

30 The expression of the engineered RNA precursors is driven by regulatory 

sequences, and the vectors of the invention can include any regulatory sequences known 
in the art to act in mammalian cells, e.g., human or murine cells; in insect cells; in plant 
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cells; or other cells. The term regulatory sequence includes promoters, enhancers, and 
other expression control elements. It will be sqppreciated that the appropriate regulatory 
sequence depends on such factors as the future use of the cell or transgenic animal into 
which a sequence encoding an engineered RNA precursor is being introduced, and die 
5 level of expression of the desired RNA precursor. A person skilled in the art would be 
able to choose the appropriate regulatory sequence. For example, the transgenic animals 
described herein can be used to determine the role of a test polypeptide or the engineered 
KNA precursors in a particular cell type, e.g., a hematopoietic cell. Li this case, a 
regulatory sequence that drives expression of the transgene ubiquitously, or a 
10 hematopoietic-specific regulatory sequence that expresses the transgene only in 
hematopoietic cells, can be used. Expression of the engineered RNA precursors in a 
hematopoietic cell means that the cell is now susceptible to specific, targeted RNAi of a 
particular gene. Examples of various regulatory sequences are described below. 

The regulatory sequences can be inducible or constitutive. Suitable constitutive 
1 5 regulatory sequences include the regulatory sequence of a housekeeping gene such as 
1 the o^actin regulatory sequence, or may be of viral origin such as regulatory sequences 
derived fix)m mouse mammary tumor virus (MMTV) or cytomegalovirus (CMV). 

Alternatively, the regulatory sequmce can direct transgene expression in specific 
organs or cell types (see, e.g., Lasko et al., 1992, Proc. Natl. Acad Sci. USA 89:6232). 

20 Several tissue-specific regulatory sequences are known in the art including the albumin 
regulatory sequence for liver (Pinkert et al., 1987, Genes Dev. 1:268276); the endothelin 
regulatory sequence for endothelial cells O^ee, 1990, J. Biol. Chem. 265:10446-50); the 
keratin regulatory sequence for epidermis; the myosin light chain-2 regulatory sequence 
for heart (Lee et al., 1992, J. Biol Chem. 267:15875-85), and the insulin regulatory 

25 sequence for pancreas (Bucchini et al., 1986, Proc. Natl Acad. Sci. USA 83:251 1-2515), 
or the vav regulatory sequence for hematopoietic cells (OUgvy et al., 1999, Proc. Natl. 
Acad Sci. USA 96:14943-14948). Another suitable regulatory sequence, which directs 
constitutive expression of transgenes in cells of hematopoietic origin, is the murine 
MHC class I regulatory sequence (Morello et al., 1986, EMBOJ. 5:1877-1882). Since 
. 30 NMC expression is induced by cytokines, expression of a test gene operably linked to 
this regulatory sequence can be upregulated in the presence of cytokines. 
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In addition, expression of the transgene can be precisely regulated, for example, 
by using an inducible regulatory sequence and expression systems such as a regulatory 
sequence that is sensitive to certain physiological regulators, e,g„ circulating glucose 
levels, or honnones (Docherty et al,, 1994, FASEB J. 8:20-24). Such inducible 

5 expression systems, suitable for the control of transgene expression in cells or in 
mammals such as mice, include regulation by ecdysone, by estrogen, progesterone, 
tetracycline, chemical inducers of dimerization, and isopropyl-beta-Dl - 
thiogalactopyranoside (IPTG) (collectively referred to as "the regulatory molecule'). 
Each of these expression systems is well described in the literature and pramits 

10 expression of the transgene throughout the animal in a manner controlled by the 

presOTce or absence of the regulatory molecule. For a review of inducible expression 
systems, see, e.g.. Mills, 2001, Genes Devel 15:1461-1467, and references cited therein. 

The regulatory elements referred to above include, but are not limited to, the 
cytomegalovirus hCMV immediate early gene, the early or late promoters of SV40 

15 adenovirus (Bemoist et al, Nature, 290:304, 1981), the tet system, the lac system, the 
62 system, the TAG system, the TRC system, the major operator and promoter regions 
of phage A, the control regions of fd coat protein, the promoter for 3-phosphoglycerate 
kinase, the promoters of acid phosphatase, and the promoters of the yeast a-mating 
factors. Additional promoters include the promoter contained in the 3' long terminal 

20 repeat of Rous sarcoma virus (Yamamoto et al.. Cell 22:787-797, 1988); the herpes 
thymidine kinase promoter (Wagner al, Proc. Natl. Acad. Sci. USA 78:1441, 1981); 
or the regulatory sequences of the metallothionein gene (Brinster et al. Nature 296:39, 
1988). 

25 4. Assav for Testing Engineered RNA Precursors 

Drosophila embryo lysates can be used to determine if an engineered RNA 
precursor was, in fact, the direct precursor of a mature stRNA or siRNA This lysate 
assay is described in Tuschl et al., 1999, supra, Zamore et al, 2000, supra, and 
Hutvdgner et al. 2001, supra. These lysates recapitulate RNAi in vitro, thus permitting 
30 investigation into whether the proposed precursor RNA was cleaved into a mature 

StRNA or siRNA by an RNAi-like mechanism. Briefly, the precursor RNA is incubated 
with Drosophila embryo lysate for various times, then assayed for the production of the 
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mature siiElNA or stRNA by primer extension or Northern hybridization. As in the in 
vivo setting, mature KNA accumulates in the cell-free reaction. Thus, an RNA 
corresponding to the proposed precxursor can be shown to be converted into a mature 
sfRNA or siRNA duplex in the Drosophila embryo lysate. 
5 Furthermore, an engineered RNA precursor can be functionally tested in the 

Drosophila embryo lysates. In this case, the engineered RNA precursor is incubated in 
the lysate in the presence of a 5' radiolabeled target mRNA in a standard in vitro RNAi 
reaction for various lengths of time. The target mRNA can be 5' radiolabeled using 
guanylyl transferase (as described in Tuschl et al, 1999, 5t(pra and references therein) or 

10 other suitable methods. The products of the in vitro reaction are tiien isolated and 

analyzed on a denaturing acrylamide or agarose gel to determine if the target mRNA has 
been cleaved in response to the presence of the engineered RNA precursor in the 
reaction. The extent and position of such cleavage of the mRNA target will indicate if 
the engineering of the precursor created a pre-siRNA capable of mediating sequence- 

15 specific RNAi, » 

in. Methods of Introducing RNAs, Vectors, and Host Cells 

Physical methods of introducing nucleic acids include injection of a solution 
containing the RNA, bombardment by particles covered by the RNA, soaking the cell or 

20 organism in a solution of the RNA, or electroporation of cell membranes in the presence 
of the RNA. A viral construct packaged into a viral particle would accomplish both 
efficient introduction of an expression construct into the cell and transcription of RNA 
encoded by the expression construct. Other methods known in the art for introducing 
nucleic acids to cells may be used, such as lipid-mediated carrier transport, chemical- 

25 mediated transport, such as calcium phosphate, and the like. Thus the RNA may be 
introduced along with components that perform one or more of the following activities: 
enhance RNA uptake by the cell, inhibit annealing of single strands, stabilize the single 
strands, or other-wise increase inhibition of the target gene. 

RNA may be directly introduced into the cell (i.e., intracellularly); or introduced 
30 extracellularly mto a cavity, interstitial space, into the circulation of an organism, 
introduced orally, or may be introduced by bathing a cell or organism in a solution 
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containing the RNA. Vascular or extravascular circulation, the blood or lymph system, 
and the cerebrospinal fluid are sites where the RNA may be introduced. 

The cell with the target gene may be derived fioom or contained in any organism. 
The organism may a plant, animal, protozoan, bacterium, virus, or fungus. The plant 
may be a monocot, dicot or gymnospenn; the animal may be a vertebrate or invertebrate. 
Preferred microbes are those used in agriculture or by industry, and those that are ' 
pathogenic for plants or animals. Fungi include organisms in both the mold and yeast 
morphologies. Plants include arabidopsis; field crops alfalfa, barley, bean, com, 
cotton, flax, pea, rape, nice, rye, sajfflower, sorghum, soybean, sunflower, tobacco, and 
wheat); vegetable crops (e.g., asparagus, beet, broccoli, cabbage, carrot, cauliflower, 
celery, cucumber, eggplant, lettuce, onion, pepper, potato, pumpkin, radish, spinach, 
squash, taro, tomato, and zucchini); fruit and nut crops (e.g., almond, apple, apricot, 
banana, black- berry, blueberry, cacao, cherry, coconut, cranberry, date, faJoa, filbert, 
grape, grapefruit, guava, kiwi, lemon, lime, mango, melon, nectarine, orange, papaya, 
passion fruit, peach, peanut, pear, pineapple, pistachio, plum, raspberry, strawberry, 
tangerine, walnut, and watermelon); and ornamentals (e.g., alder, ash, aspen, azalea, 
birch, boxwood, camellia, carnation, chrysanthemum, ekn, fir, ivy, jasmine, juniper, oak, 
palm, poplar, pine, redwood, rhododendron, rose, and rabber). Examples of vertebrate 
animals include fish, mammal, cattle, goat, pig, sheep, rodent, hamster, mouse, rat, 
primate, and human; invertebrate animals include nematodes, other worms, drosophila, 
and other insects. 

The skilled artisan will ^preciate that the enumerated organisms are also useful 
for practicing other aspects of the invention, e.g., making transgenic organisms as 
described infra. 

The cell having the target gene may be from the germ line or somatic, totipotent 
or pluripotent, dividing or non-dividing, parenchyma or epithelium, immortalized or 
transformed, or the like. The cell may be a stem cell or a differentiated cell. Cell types 
that are differentiated include adipocytes, fibroblasts, myocytes, cardiomyocytes, 
endothelium, neurons, glia, blood cells, megakaryocytes, lymphocytes, macrophages, 
neutrophils, eosinophils, basophils, mast cells, leukocytes, granulocytes, keratinocytes, • 
chondrocytes, osteoblasts, osteoclasts, hepatocytes, and cells of the endocrine or 
exocrine glands. 
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Depending on the particular target gene and the dose of double stranded RNA 
material delivered, this process may provide partial or complete loss of function for the 
target gene. A reduction or loss of gene expression in at least 50%, 60%, 70%, 80%, 
90%, 95% or 99% or more of targeted cells is exemplary. Inhibition of gene expression 
5 refers to the absence (or observable decrease) in the level of protein and/or nciKNA 
product from a target gene. Specificity refers to the ability to inhibit the target gene 
without manifest effects on other genes of the cell. The consequences of inhibition can 
be confirmed by examination of the outward properties of tiie cell or organism (as 
presented below in the examples) or by biochemical techniques such as RNA solution 
10 hybridization, nuclease protection. Northern hybridization, reverse transcription, gene 
expression monitoring with a naicroarray, antibody binding, enzyme linked 
immunosorbent assay (ELISA), Westem blotting, radioimmunoassay (RIA), other 
inomunoassays, and fluorescence activated cell analysis (FACS). 

For RNA-mediated inhibition in a cell line or whole organism, gene expression is 

1 5 conveniently assayed by use of a reporter or drug resistance gene whose protein product 
is easily assayed. Such reporter genes include acetohydroxyacid synthase (AHAS), 
alkaline phosphatase (AP), beta galactosidase (LacZ), beta glucoronidase (GUS), 
chloramphenicol acetyltransferase (CAT), green fluorescent protein (GFP), horseradish 
peroxidase (HRP), luciferase (Luc), nopaline synthase (NOS), octopine synthase (OCS), 

20 and derivatives thereof. Multiple selectable markers are available that confer resistance 
to ampicillin, bleomycin, chloramphenicol, gentamycin, hygromycin, kanamycin, 
lincomyciUi methotrexate, phosphinothricin, puromycin, and tetracyclin. Depending on 
the assay, quantitation of the amount of gene expression allows one to determine a 
degree of inhibition which is greater than 10%, 33%, 50%, 90%, 95% or 99% as 

25 compared to a cell not treated according to the present invention. Lower doses of 
injected material and longer times after administration of an RNAi agent may result in 
inhibition in a smaller fraction of cells {e.g., at least 10%, 20%, 50%, 75%, 90%, or 95% 
of targeted cells). Quantitation of gene expression in a cell may show similar amounts, 
of inhibition at the level of accumulation of target mKNA or translation of target protem. 

30 As an example, the efficiency of inhibition may be determined by assessing the amount 
of gene product in the cell; nxRNA may be detected with a hybridization probe having a 
nucleotide sequence outside the region used for the inhibitory double-stranded RNA, or 
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translated polypeptide may be detected with an antibody raised against the polypeptide 
sequence of that region. 

The RNA may be introduced in an amount which allows delivery of at least one 
copy per cell. Higher doses (e.g., at least 5, 10, 100, 500 or 1000 copies per cell) of 
5 material may yield more effective inhibition; lower doses may also be useful for specific 
applications. 

IV. Methods of Treatment: 

The present invention provides for both prophylactic and therapeutic methods of 

10 treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant or unwanted target gene expression or activity. 'Treatment, or **treating" 
as used herein, is defined as the application or administration of a ther^eutic agent (e.g., 
a RNAi agent or vector or transgene encoding same) to a patient, or application or 
administration of a ther^eutic agent to an isolated tissue or cell line from a patient, who 

15 has a disease or disorder, a symptom of disease or disorder or a predisposition toward a 
disease or disorder, with the purpose to cure, heal, alleviate, relieve, alter, remedy, 
ameliorate, improve or affect the disease or disorder, the symptoms of the disease or 
disorder, or the predisposition toward disease. 

With regards to both prophylactic and therapeutic methods of treatment, such 

20 treatments may be specifically tailored or modified, based on knowledge obtained fix)m 
the field of pharmacogenomics. "Pharmacogenomics", as used herein, refers to the 
application of genomics technologies such as gene sequencing, statistical genetics, and 
gene expression analysis to drugs in clinical development and on the market. More 
specifically, the term refers the study of how a patienf s genes determine his or her 

25 response to a drug a patient's "drug req[)onse phenotype", or "drug response 
genotype"). Thus, another aspect of the invention provides methods for tailoring an 
individual's prophylactic or therapeutic treatment with either the target gene molecules 
of tfie present mvention or target gene modulators according to that individual's drug 
response genotype. Pharmacogenomics allows a clinician or physician to target 

30 prophylactic or ther^eutic treatments to patients who will most benefit fix>m the 

treatment and to avoid treatment of patients who will experience toxic drug-related side 
effects. 
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1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with an aberrant or unwanted target gene expression or 
activity, by administering to the subject a ther^eutic agent (e,g., a RNAi agent or vector 
5 or transgene encoding same). Subjects at risk for a disease which is caused or 
contributed to by aberrant or unwanted target gene expression or activity can be 
identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the 
manifestation of symptoms characteristic of the target gene aberrancy, such that a 
1 0 disease or disorder is prevented or, alternatively, delayed in its progression. Depending 
on the type of target gene aberrancy, for example, a target gene, target gene agonist or 
target gene antagonist agent can be used for treating the subject. The appropriate agent 
can be determined based on screening assays described herein. 

2. Therapeutic Methods 

1 5 Another aspect of the invention pertains to methods of modulating target gene 

expression, protein expression or activity for therapeutic purposes. Accordingly, in an 
exenq)lary embodiment, the modulatory method of the invention involves contacting a 
cell capable of expressing target gene with a therapeutic agent (e.g., a RNAi agent or 
vector or transgene encoding same) that is specific for the target gene or protein (e,g., is 

20 specific for the mKNA encoded by said gene or specifjring the amino acid sequence of 
said protein) such that expression or one or more of the activities of target protein is 
modulated. These modulatory mefliods can be performed in vitro (e.g, by culturing the 
cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a 
subject). As such, the present invention provides methods of treating an individual 

25 afflicted with a disease or disorder characterized by aberrant or unwanted expression or 
activity of a target gene polypeptide or nucleic acid molecule. Inhibition of target gene 
activity is desirable in situations in which target gene is abnormally unregulated and/or 
in which decreased target gene activity is likely to have a beneficial effect 

3. Pharmacop enomics 

30 The therapeutic agents (e.g., a RNAi agent or vector or transgene encodmg same) 

of the invention can be administered to individuals to treat (prophylactically or 
thOT^eutically) disorders associated with aberrant or unwanted target gene activity. In 
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conjunction with such treatment, pharmacogenomics {i.e., the study of the relationship 
between an individual's genotype and that individuars response to a foreign compound 
or drug) may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
5 concentration of the pharmacologically active drug. Thus, a physician or clinician may 
consider applying knowledge obtained in relevant pharmacogenomics studies in 
detemiining whether to administer a therapeutic agent as well as tailoring the dosage 
and/or therapeutic regimen of treatment witii a therapeutic agent. 

Pharmacogenomics deals with clinicsdly significant hereditary variations in the 
10 response to drugs due to altered drug disposition and abnormal action in affected 

persons. See, for example, Eichelbaum, M. et al (1996) Clin, Exp, Pharmacol Physiol 
23(10-11): 983-985 and linder, M.W. et al (1997) Clin. Chem. 43(2):254.266. In 
general, two types of pharmacogenetic conditions can be differentiated. Genetic 
conditions transmitted as a single factor altering the way drugs act on the body (altered 
1 5 drug action) or genetic conditions transmitted as single factors altering the way the body 
acts on drugs (altered drug metaboUsm). These pharmacogenetic conditions can occur 
either as rare genetic defects or as naturally-occurring polymorphisms. For example, 
glucose-6-phosphate dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is haemolysis after ingestion of 
20 oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 
fiiva beans. 

One pharmacogenomics approach to identifying genes that predict drug 
response, known as "a genome-wide association", relies primarily on a high-resolution 
mi?> of the human genome consisting of aheady known gene-related markers (e.^., a "bi- 

25 allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 
sites on the human genome, each of which has two variants.) Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 
significant number of patients taking part in a Phase D/III drug trial to identify markers 
associated with a particular observed drug response or side effect. Alternatively, such a 

30 hig^ resolution map can be generated fsxmx a combination of some ten-million known 

single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 

"SNP" is a common alteration that occurs in a single nucleotide base in a stretch of 

DNA. For example, a SNP may occur once per every 1000 bases of DNA. A SNP may 
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be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 
be grouped into genetic categories depending on a particular pattern of SNPs in their 
individual genome. In such a manner, treatment regimens can be tailored to groups of 
5 genetically sinailar individuals, taking into account traits that may be common among 
such genetically similar individuals. 

Altematively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 
encodes a drugs target is known (eg., a target gene polypeptide of the present 
1 0 invention), all common variants of that gene can be fairly easily identified in the 

population and it can be detemuned if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 

1 5 genetic polymorphisms of drug metabolizing enzymes (e.g. , N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 
These polymorphisms are expressed in two phenotypes in the population, the extensive 

20 metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is diflFerent among 
different populations. For example, the gene codmg for CYP2D6 is higjily polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
fimctional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 
experience exaggerated drug response and side effects when they receive standard doses. 

25 If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed 
metabolite morphine. The oth^r extreme are the so called ultra-rapid metabolizers who 
do not respond to standard doses. Recently, the molecular basis of ultra-rapid 
metabolism has been identified to be due to CYP2D6 gene amplification. 

30 
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Alternatively, a method tenned the "gene expression profihng", can be utihzed to 
identify genes that predict drug response. For example, the gene expression of an 
animal dosed with a therapeutic agent of the present invention can give an indication 
whether gene pathways related to toxicity have been turned on. 

5 Information generated from more than one of the above pharmacogenomics 

approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when appUed to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with a therapeutic 
10 agent, as described herein. 

Therapeutic agents can be tested in an appropriate animal model. For example, 
an RNAi agent (or expression vector or transgene encoding same) as described herein 
can be used in an animal model to determine the efficacy, toxicity, or side effects of 
treatment with said agent. Alternatively, a therapeutic agent can be used in an animal 
15 model to determine the mechanism of action of such an agent For example, an agent 
can be used in an animal model to determine the efficacy, toxicity, or side efiFects of 
treatment with such an agent. Alternatively, an agent can be used in an animal model to 
determine the mechanism of action of such an agent. 

V. Pharmaceutical Compositions 

20 The invention pertains to uses of the above-described agents for therapeutic 

treatments as described infra. Accordingly, the modulators of the present invention can 
be incorporated into pharmaceutical compositions suitable for administration. Such 
compositions typically comprise the nucleic acid molecule, protein, antibody, or 
modulatory compound and a pharmaceutically acceptable carrier. As used herein the 

25 language "pharmaceutically acceptable carrier" is intended to include any and all 
solvents, dispersion media, coatings, antibacterial and antifimgal agents, isotonic and 
absorption delaying agents, and the like, compatible with pharmaceutical administration. 
The use of such media and agents for pharmaceutically active substances is well known 
in the art. Except insofar as any conventional media or agent is incompatible with the 

30 active compound, use thereof in the compositions is contemplated. Supplementary 
active compounds can also be incorporated into the con:q>ositions. 
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A pharmaceutical composition of the invention is fonnulated to be compatible 
with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, intraperitoneal, intramuscular, 
oral (e.^., inhalation), transdermal (topical), and transmucosal administration. Solutions 

5 or suspensions used for parenteral, intradermal, or subcutaneous application can include 
the following components: a sterile diluent such as water for injection, saline solution, 
fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascoibic add or sodium bisulfite; chelating agents such as ethylCTiediaminetetraacetic 

10 acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

15 Pharmaceutical compositions suitable for injectable use include sterile aqueous 

solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous 
administration, suitable carriers include physiological saline, bacteriostatic water, 
Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). In all 

20 cases, the composition must be sterile and should be fluid to the extent that easy 

syringabihty. exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fimgi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 

25 polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 

30 ascorbic acid, fhimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the coniposition. Prolonged absorption of the injectable compositions can be 
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brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compoimd in the required amount in an appropriate solvent with one or a combination of 
5 ingredimts enumerated above, as required, followed by filtered sterilization. Generally, 
dispersions are prepared by incorporating the active compound into a sterile vehicle 
which contains a basic dispersion medium and the required other ingredients firom those 
enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, the preferred methods of preparation are vacuum drying.and freeze-drying 
10 which yields a powder of the active ingredient plus any additional desired ingredient 
fix>m a previously sterile-filtered solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients aiid 

15 used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherem the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 

20 following ingredients, or compounds of a similar nature: a binder such as 

microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a lubricant 
such as magnesium stearate or Sterotes; a ghdant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 

25 methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e,g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
30 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
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fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

5 The compounds can also be prepared in the form of suppositories (e.g., wiflfi 

conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
10 release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vmyl acetate, 
polyanhydrides, polyglycolic acid, coUagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art 
The materials can also be obtained commercially from Alza Corporation and Nova 
15 Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described m U.S. Patent No. 4,522,81 1. 

It is especially advantageous to formulate oral or parenteral compositions in 
20 dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each imit containing a predetermined quantity of active compound 
calculated to produce tiie desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
25 dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in flie art 
of compounding such an active compound for the treatment of individuals. 

Toxicity and therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e,g,, for 
30 detOTnining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
ttier^utically effective in 50% of the population). The dose ratio between toxic and 
lher25)eutic effects is the therapeutic index and it can be expressed as the ratio 
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LD50/ED50. Gompoimds that exhibit large therapeutic indices are preferred. Although 
compounds that exhibit toxic side effects may be used, care should be talcen to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

5 The data obtained from the cell culture assays and animal studies can be used in 

formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 

1 0 method of the invention, the therapeutically efifective dose can be estimated initially 
from cell culture assays. A dose may be formulated in animal models to achieve a 
circulating plasma concentration range that includes the EC50 (/.e., the concentration of 
the test compound which achieves a half-maxunal response) as determined in cell 
culture. Such information can be used to more accurately determine useful doses in 

1 5 humans. Levels in plasma may be measured, for example, by high performance Uquid 
chromatography. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

VI. Knockout and/or Knockdown Cells or Organisms 

20 A further preferred use for the RNAi agents of the present invention (or vectors 

or transgenes encoding same) is a fimctional analysis to be carried out m eukaryotic 
cells, or eukaryotic non-human organisms, preferably mammalian cells or organisms and 
most preferably human cells, e.g, cell lines such as HeLa or 293 or rodents, e.g, rats and 
mice. By administering a suitable RNAi agent which is sufficiently complementary to a 

25 target mRNA sequence to direct target-specific RNA interference, a specific knockout or 
knockdown phenotype can be obtained in a target cell, e.g. in cell culture or in a target 
organism. 

Thus, a further subject matter of the invention is a eukaryotic cell or a eukaryotic 
non-human organism exhibiting a target gene-specific knockout or knockdown 
30 phenotype comprising a fully or at least partially deficient eoqpression of at least one 
endogeneous target gene wherein said cell or organism is transfected with at least one 
vector comprising DNA encoding an RNAi agent capable of inhibiting tiie expression of 
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the target gene. It should be noted that the present invention allows a target-specific 
knockout or knockdown of several different endogeneous genes due to the specificity of 
the RNAi agent 

Gene-specific knockout or knockdown phenotypes of cells or non-human 
5 organisms, particularly of human cells or non-human mammals may be used in analytic 
to procedures, e.g. in the fimctional and/or phenotypical analysis of complex 
physiological processes such as analysis of gene expression profiles and/or proteomes. 
Preferably the analysis is carried out by high throughput methods using oligonucleotide 
based chips. 

10 Using RNAi based knockout or knockdown technologies, the expression of an 

endogeneous target gene may be inhibited in a target cell or a target organism. The 
endogeneous gene may be complemented by an exogenous target nucleic acid coding for 
the target protein or a variant or mutated fonn of the target protein, e.g. a gene or a 
DNA, which may optionally be fused to a further nucleic acid sequence encoding a 

15 detectable peptide or polypeptide, e.g, an affinity tag, particularly a multiple affinity tag. 

Variants or mutated forms of the target gene dififer &om the endogeneous target 
gene in that they encode a gene product which differs firom the endogeneous gene 
product on the amino acid level by substitutions, insertions and/or deletions of single or 
multiple amino acids. The variants or mutated forms may have the same biological 

20 activity as the endogeneous target gene. On the other hand, the variant or mutated target 
gene may also have a biological activity, which differs fi?om the biological activity of the 
endogeneous target gene, e.g, a partially deleted activity, a completely deleted activity, 
an enhanced activity etc. The complementation may be accomplished by compressing 
the polypeptide encoded by the endogeneous nucleic acid, e.g. a fusion protein 

25 comprising the target protein and the affinity tag and the double stranded RNA molecule 
for knocking out the endogeneous gene in the target cell. This compression may be 
accomplished by using a suitable expression vector expressing both the polypeptide 
encoded by the endogenous nucleic acid, e,g. the tag-modified target protein and the 
double stranded RNA molecule or alternatively by using a combination of expression 

30 vectors. Proteins and protein complexes which are synthesized de novo ia the target cell 
will contain the exogenous gene product, e.g., the modified fusion protein. In order to 
avoid suppression of the exogenous gene product by the RNAi agent, the nucleotide 
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sequence encoding the exogenous nucleic acid may be altered at the DNA level (with or 
without causing mutations on the amino acid level) in the part of the sequence which is 
homologous to the RNAi agent. Alternatively, the endogeneous target gene may he 
complemented by corresponding nucleotide sequences from other species, e.g. fix)m 

5 mouse. 

vn. Transgenic Organisms 

Engineered KNA precursors of the invention can be expressed in transgenic 
animals. These animals represent a model system for the study of disorders that are 
caused by, or exacerbated by, overexpression or underexpression (as compared to 

10 wildtype or normal) of nucleic acids (and their encoded polypeptides) targeted for 

destruction by the RNAi agents, e.g., siRNAs and shRNAs, and for the development of 
therapeutic agents that modulate the expression or activity of nucleic acids or 
polypeptides targeted for destruction. 

Transgenic animals can be farm animals (pigs, goats, sheep, cows, horses, 

15 nibbits, and the like), rodents (such as rats, guinea pigs, and mice), non-human primates 
(for example, baboons, monkeys, and chimpanzees), and domestic animals (for example, 
dogs and cats). Invertebrates such as Caenorhabditis elegans or Drosophila can be used 
as well as non-mammalian vertebrates such as fish (e.g., zebrafish) or birds (e-g., 
chickens). 

20 Engineered RNA precursors with stems of 1 8 to 30 nucleotides in length are 

preferred for use in mammals, such as mice. A transgenic founder animal can be 
identified based upon the presence of a transgene that encodes the new RNA precursors 
in its genome, and/or expression of the transgene in tissues or cells of the animals, for 
example, using PGR or Northern analysis. Expression is confirmed by a decrease in the 

25 expression (RNA or protein) of the target sequence. 

A transgenic founder animal can be used to breed additional animals carrying the 
transgene. Moreover, transgenic animals carrying a transgene encoding the RNA 
precursors can further be bred to other transgenic animals carrying other transgenes. In 
addition, cells obtained fix)m the transgenic founder animal or its offspring can be 

30 cultured to establish primary, secondary, or immortal cell lines containing the transgene. 
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1. Procedures for Making Transgepic, N on-Human Animals 
A number of methods have been used to obtain transgenic, non-human animals, 
which are animals that have gained an additional gene by the introduction of a transgene 
into their cells (e.g., bolh the somatic and gem cells), or into an ancestor's genn line. In 

5 some cases, transg^c animals can be generated by commercial facihties (e.g.. The 
Transgenic Drosophila Facility at Michigan State University, The Transgenic Zebrafish 
Core Facility at the Medical College of Geor^a (Augusta, Georgia), and Xenogen 
Biosciences (St. Louis, MO), hi general, the construct containing the transgene is 
supplied to flie facility for generating a transgenic animal 

1 0 Methods for generating transgenic animals include introducing the transgene into 

the germ line of the aoimal. One method is by microinjection of a gene construct into 
the pronucleus of an early stage embryo (e.g., before the four-cell stage; Wagner et al., 
1981, Proc. Natl. Acai Sci. USA 78:5016; Brinster et al., 1985, Proc. Natl. Acad. Sci. 
USA 82:4438). Alternatively,, the transgene can be introduced into the pronucleus by 

15 retroviral infection. A detailed procedure for producing such transgenic mice has been 
described (see e.g., Hogan et al., MPl ulating the Mouse EmbnLo. Cold Spring Harbour 
Laboratory, Cold Spring Harbour, NY (1986); U.S. Patent No. 5,175,383 (1992)). This 
procedure has also been adapted for other animal species (e.g.. Hammer et al., 1985, 
Nature 315:680; Murray et al., 1989, Rqprod. Pert. Devi. 1:147; Pursel et al., 1987, Vet. 

20 hmnunol. Histopath. 17:303; Rexroad et al., 1990, J. Reprod. Pert. 41 (suppl): 1 19; 
Rexroad et al., 1989, Molec. Reprod. Devi. 1:164; Simons et al., 1988, BioTechnology 
6:179; Vize et al„ 1988, J. Cell. Sci. 90:295; and Wagner, 1989, J. Cell. Biochem. 13B 
(suppl): 164). 

In briet the procedure mvolves introducing the transgene into an animal by 
25 microinjecting the construct into the pronuclei of the fertilized mammalian egg(s) to 
cause one or more copies of the transgene to be retained in the cells of the developmg 
marmnal(s). Following introduction of the transgene construct into the fertilized egg, 
the egg may be incubated in vitro for varying amounts of time, or reimplanted a in 
surrogate host, or both. One common method is to incubate the embryos in vitro for 
30 about 1-7 days, depending on the species, and then reimplant them into the surrogate 
host. The presence of the transgene in the progeny of the transgenically manipulated 
embryos can be tested by Southern blot analysis of a segment of tissue. 
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Another method for producmg germ-line transgenic animals is through the use of 
embryonic stem (ES) cells. The gene construct can be introduced into embryonic stem 
cells by homologous recombination (Thomas et al., 1987, Cell 51:503; Capecchi, 
Science 1989, 244:1288; Joyner et al, 1989, Nature 338:153) in a transcriptionally 

5 active region of the genome. A suitable construct can also be introduced into embryonic 
stem cells by DNA-mediated transfection, such as by 17 electroporation (Ausubel et al.. 
Current Protocols in Molecular Biology, John Wiley & Sons, 1987). Detailed 
procedures for culturing embryonic stem cells (e.g., ES-D3@ ATCC# CCL-1934, ES- 
E14TG2a, ATCC# CCL-1821, American Type Culture Collection, Rockville, AM) and 

10 methods of making transgenic animals from embryonic stem cells can be found in 

Teratocarcinomas and Embryonic Stem Cells, A Practical Approach, ed. E. J. Robertson 
QRL Press, 1987). Li brief, the ES cells are obtained from pre-implantation embryos 
cultured in vitro (Evans et al., 1981, Nature 292: 154-156). Transgenes can be efficiently 
introduced into ES cells by DNA transfection or by retrovirus-mediated transduction. 

15 The resulting transformed ES cells can thereafter be combined with blastocysts from a 
non-human animal. The ES cells colonize the embryo and contribute to the germ line of 
the resulting chimeric animal. 

In the above methods, the transgene can be introduced as a linear construct, a 
circular plasmid, or a viral vector, which can be incorporated and inherited as a 

20 transgene integrated into the host genome. The transgene can also be constructed to 
permit it to be inherited as an extrachromosomal plasmid (Gassmann et al., 1995, Proc. 
Natl. Acad. Sci. USA 92: 1292). A plasmid is a DNA molecule that can replicate 
autonomously in a host. 

The transgenic, non-human animals can also be obtained by infecting or 

25 transfecting cells either in vivo (e.g., direct injection), ex vivo (e.g., infecting the cells 
outside the host and later reimplanting), or in vitro (e.g., infecting the cells outside host), 
for example, with a recombinant viral vector cauying a gene encoding the engineered 
RNA precursors. Examples of suitable viral vectors include recombinant retroviral 
vectors (Valerio et al., 1989, Gene 84:419; Scharfinan et al., 1991, Proc. Natl. Acad. Sci. 

30 USA 88:462; Miller and Buttimore, 1986, MoL Cell. Biol. 6:2895), recombinant 

adenoviral vectors (Freidman et al., 1986, Mol. Cell. Biol. 6:3791; Lefvrero et al., 1991, 
Gene 101 : 195), and recombiaant Herpes simplex viral vectors (Fink et al., 1992, 
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Human Gene Therapy 3:11). Such methods are also useful for introducing constructs 
into cells for uses other than generation of transgenic animals. 

Other approaches include insertion of transgenes encoding the new engineered 
RNA precursors into viral vectors including recombinant adenovirus, adenoassociated 

5 virus, and herpes simplex virus- 1 , or recombinant bacterial or eukaryotic plasmids. Viral 
vectors transfect cells directly. Other approaches include delivering the transgenes, in 
the form of plasmid DNA, with the help of, for example, cationic Hposomes (lipofectin) 
or derivatized (e.g. antibody conjugated) polylysine conjugates, gramacidin S, artificial 
viral envelopes, or other such intracellular carriers, as well as direct injection of the 

10 transgene construct or CaP04 precipitation carried out in vivo. Such methods can also 
be used in vitro to introduce constructs into cells for uses other than generation of 
transgenic animals. 

Retrovirus vectors and adeno-associated virus vectors can be used as a 
recombinant gone delivery system for the transfer of exogenous genes in vivo or in vitro. 

15 These vectors provide efficient delivery of genes into cells, and the transferred nucleic 
acids are stably integrated into the chromosomal DNA of the host. The development of 
specialized cell lines (tenmed "packaging cells") which produce only replication- 
defective retroviruses has increased the utility of retroviruses for gene therapy, and 
defective retroviruses are characterized for use in gene transfer for gene therapy 

20 puiposes (for a review see Miller, 1990, Blood 76:271). A replication defective 

retrovirus can be packaged into virions which can be used to infect a target cell througji 
the use of a helper virus by standard techniques. Protocols for producing recombinant 
retroviruses and for infecting cells in vitro or in vivo with such viruses can be found in 
Current Protocols in Molecular Biology, Ausubel, F.M. et al., (eds.) Greene Publishing 

25 Associates, (1989), Sections 9 9.14 and other standard laboratory manuals. 

Examples of suitable retroviruses include pU, pZIP, pWE and pEM which are 
known to those skilled in the art. Examples of suitable packaging virus lines for 
preparing both ecotropic and amphotropic retroviral systems include Psi-Crip, PsiCre, 
Psi-2 and Psi-Am. Retroviruses have been used to introduce a variety of genes into 
30 many different cell types, including epithelial cells, in vitro and/or in vivo (see for 

example Eglitis, et al., 1985, Science 230:1395-1398; Danos and Mulligan, 1988, Proc. 
Natl. Acad. Sci. USA 85:6460-6464; Wilson et al., 1988, Proc. Natl. Acad. Sci. USA 
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85:3014-3018; Annentano et al., 1990, Proc. Natl. Acad. Sci. USA 87:61416145; Huber 
et al., 1991, Proc. Natl. Acad. Sci. USA 88:8039-8043; Ferry et al., 1991, Proc. Natl. 
Acad. Sci. USA 88:8377-8381; Chowdhury et al., 1991, Science 254:1802-1805; van 
Beusechem. et al, 1992, Proc. Nad, Acad. Sci. USA 89:7640-19 ; Kay et al., 1992, 
5 Human Gene Therapy 3:641-647; Dai et al., 1992, Proc. Natl. Acad. Sci. USA 
89:10892-10895; Hwu et al., 1993, J. Immunol. 150:4104-4115; U.S. Patent No. 
4,868,116; U.S. Patent No. 4,980,286; PCX AppUcation WO 89/07136; PCT Application 
WO 89/02468; PCT Application WO 89/05345; and PCT Application WO 92/07573). 

In another example, recombinant retroviral vectors capable of transducing and 

10 expressing genes inserted into the genome of a cell can be produced by transfecting the 
recombinant retroviral genome mto suitable packaging cell lines such as PA317 and Psi- 
CRIP (Comette et al., 1991, Human Gene Therapy 2:5-10; Cone et al., 1984, Proc. Natl. 
Acad. Sci. USA 81 :6349). Recombinant adenoviral vectors can be used to infect a wide 
variety of cells and tissues in susceptible hosts (e.g., rat, hamster, dog, and chimpanzee) 

15 (Hsu et al., 1992, J. Infectious Disease, 166:769), and also have the advantage of not 
requiring mitotically active cells for infection. Another viral gene delivery system 
useful in the present invention also utilizes adenovirus-derived vectors. The genome of 
an adenovirus can be manipulated such that it encodes and expresses a gene product of 
interest but is inactivated in terms of its abihty to repHcate in a normal lytic viral life 

20 cycle. See, for example, Berkner et al. (1988, BioTechniques 6:616), Rosenfeld et al. 
(1991, Science 252:431-434), and Rosenfeld et al. (1992, Cell 68:143-155). Suitable 
adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or other strains 
of adenovirus (e.g., Ad2, AO, Ad7 etc.) are known to those skilled in the art. 
Recombinant adenoviruses can be advantageous in certain circumstances in that they are 

25 not capable of infecting nondividing cells and can be used to infect a wide variety of cell 
types, including epithelial cells CElosenfeld et al.,1992, cited supra). Furthemore, the 
virus particle is relatively stable and amenable to purification and concentration, and as 
above, can be modified to affect the spectrum of infectivity. Additionally, introduced 
adenoviral DNA (and foreign DNA contained thereui) is not integrated into die genome 

30 of a host cell but remains episomal, thereby avoiding potential problems that can occur 
as a result of insertional mutagenesis hz situ where introduced DNA becomes integrated 
into the host genome (e.g., retroviral DNA). Moreover, the carrying cs^acity of the 
adenoviral genome for foreign DNA is large (up to 8 kilobases) relative to other gene 
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delivery vectors (Berkner et al. cited supra; Haj-Ahmand and Graham, 1986, J. Virol. 
57:267). 

Yet another viral vector system useful for delivery of the subject transgenes is 
the adeno-assodated virus (AAV). Adeno-associated virus is a naturally occurring 

5 defective virus that requires another vims, such as an adenovirus or a herpes virus, as a 
helper virus for efficient replication and a productive life cycle. For a review, see 
Muzyczka et al. (1992, Curr. Topics in Micro.and Immunol 158:97-129). It is also one 
of the few viruses that may integrate its DNA into non-dividing cells, and exhibits a higji 
jfrequency of stable mtegration (see for example Flotte et al. (1992, Am. J. Respir. Cell. 

10 Mol. Biol. 7:349-356; Samulski et al., 1989, J. Virol. 63:3822-3828; aadMcLaughlin et 
al. (1989, J. Virol. 62:1963-1973). Vectors containing as little as 300 base pairs of AAV 
can be packaged and can integrate. Space for exogenous DNA is limited to about 4.5 
kb. An AAV vector such as that described in Tratschin et al. (1985) MoL Cell. Biol. 
5:3251-3260 can be used to introduce DNA into cells. A variety of nucleic acids have 

15 been introduced into different cell types usmg AAV vectors (see for example Hennonat 
et al. (1984) Proc. Nad. Acad. Sci. USA 8 1:64666470; Tratschin et al. (1985) Mol. Cell. 
BioL 4:2072-2081; Wondisford et al. (1988) MoL EndocrinoL 2:32-39; Tratschin et al. 
(1984) J ViroL 51:611-619; and Flotte et al. (1993) J BioL Chem. 268:3781-3790). 

In addition to viial transfer methods, such as those illustrated above, non-viral 
20 methods can also be employed to cause expression of an shRNA or mgineered RNA 
precursor of the invention in the tissue of an animal. Most non-viral methods of gene 
transfer rely on normal mechanisms used by mammalian cells for the uptake and 
intracellular transport of macromolecules. In preferred embodimentsj'non-viral gene 
delivery systems of tiie present invention rely on endocytic pathways for the uptake of 
25 the subject gene of the invention by the targeted cell. Exemplary gene delivery systems 
of this type include liposomal derived systems, poly-lysine conjugates, and artificial 
vhal envelopes. Other embodiments include plasmid injection systems such as are 
described in Meulietal., (2001) J hivest. DerinatoL, 116(1):131-135; Cohen etal., 
(2000) Gene Ther., 7(22): 1896-905; and Tarn et al., (2000) Gene Ther., 7(21):186774. 

30 In a representative embodunent, a gene encoding an shRNA or engineered RNA 

precursor of the invention can be entrapped in Uposomes bearing positive charges on 
their surface (e.g., lipofectins) and (optionally) which are tagged Avith antibodies against 
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cell surface antigens of the target tissue (Mizuno et al., (1992) No Shinkei Geka, 20:547- 
55 1; PCX publication WO91/06309; Japanese patent application 10473 8 1; and 
European patent publication EP-A-43 075)- 

Animals harboring the transgme can be identified by detecting the presence of 

5 the transgene in genomic DNA (e.g., using Southern analysis). In addition, expression 
of the shRNA or engineered KNA precursor can be detected directly (e.g., by Northern 
analysis). Expression of the transgene can also be confirmed by detecting a decrease in 
the amount of protein corresponding to the targeted sequence. When the transgene is 
under the control of an inducible or developmentally regulated promoter, egression of 

10 the target protein is decreased when the transgene is induced or at the developmental 
stage when the transgene is expressed, respectively. 
2. Clones of Transgenic Animals 

Clones of the non-human transgenic animals described herein can be produced 
according to the methods described in Wilmut et al. ((1997) Nature, 385:810-813) and 

15 PCT publication Nos. WO 97/07668 and WO 97/07669. In brief, a cell, e.g., a somatic 
cell from the transgenic animal, can be isolated and induced to exit the growth cycle and 
enter the GO phase to become quiescent. The quiescent cell can then be fused, e.g., 
through the use of electrical pulses, to an enucleated oocyte from an animal of the same 
species from which the quiescent cell is isolated. The reconstructed oocyte is then 

20 cultured such that it develops into a morula or blastocyte and is then transferred to a 
pseudopregnant female foster animal. Offspring borne of this female foster animal will 
be clones of the ammal from which the cell, e*g., the somatic cell, was isolated. 

Chice the transgenic animal is produced, cells of the transgenic animal and cells 
from a control animal are screened to deteraiine the presence of an RNA precursor 

25 nucleic acid sequence, e.g., using polymerase chain reaction (PGR). Alternatively, the 
cells can be screened to determine if the KNA precursor is ejqnressed (e.g., by standard 
procedures such as Northem blot analysis or reverse transcriptase-polymerase chain 
reaction (RT-PCR); Sambrook et al., Molecular Cloning - A Laboratory Manual, (Cold 
Spring Harbor Laboratory, 1 989)). 

30 The transgenic animals of the present invention can be homozygous or 

heterozygous, and one of the benefits of the invention is that the target mRNA is 
effectively degraded even in heterozygotes. The present invention provides for 
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transgenic animals that carry a transgene of the invention in all their cells, as well as 
animals that carry a transgene in some, but not all of theu: cells. That is, the invention 
provides for mosaic animals. The transgene can be integrated as a single transgene or in 
concatatners, e.g., head-to-head tandems or head-to-tail tandems. 

5 For a review of techniques that can be used to generate and assess transgenic 

animals, skilled artisans can consult Gordon (IwL Rev. CytoL 1 1 5:171-229, 1989), and 
may obtain additional guidance from, for example: Hogan et al. "Manipulatiag the 
Mouse Embryo" (Cold Spring Harbor Press, Cold Spring Harbor, NY, 1986; 
Krimpenfort et aL, BiolTechnology 9:86, 1991; Palmiter et al., Cell 41:343, 1985; 

10 Kraemer et al., "Genetic Manipulation of the Early Mammalian Embryo," Cold Spring 
Harbor Press, Cold Spring Harbor, NY, 1985; Hammer et al.. Nature 315:680, 1985; 
Purcel et aL, Scieizce, 244:1281, 1986; Wagner et al., U.S. Patent No. 5,175,385; and 
Krimpenfort et al., U.S. Patent No. 5,175,384. 

3. Transgenic Plants 

1 5 Among the eukaryotic organisms featured in the invention are plants containing 

an exogenous nucleic acid tibat encodes an engineered RNA precursor of the invention. 

Accordingly, a method according to the invention comprises makmg a plant 
having a nucleic add molecule or construct, e.g., a transgene, described herein. 
Techniques for introducing exogenous micleic acids into monocotyledonous and 

20 dicotyledonous plants are known in the art, and include, without limitation, 
Agrobacterium-mediated transformation, viral vector-mediated transformation, 
electroporation and particle gun transformation, see, e.g., U.S. Patents Nos. 5,204,253 
and 6,013,863. If a cell or tissue culture is used as the recipient tissue for transformation, 
plants can be regenerated from transformed cultures by techniques known to those 

25 skilled in the art. Transgenic plants can be entered into a breeding program, e.g., to 
introduce a nucleic acid encoding a polypeptide into other lines, to transfer the nucleic 
acid to other species or for further selection of other desirable traits. Alternatively, 
transgenic plants can be propagated vegetatively for those species amenable to such 
techniques. Progeny includes descendants of a particular plant or plant line. Progeny of 

30 a plant include seeds formed on Fl, F2» F3, and subsequent generation plants, or seeds 
formed on BQ, BC2, BC3, and subsequent generation plants. Seeds produced by a 
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transgenic plant can be grown and then selfed (or outcrossed and selfed) to obtain seeds 
homozygous for the nucleic acid encoding a novel polypeptide. 

A suitable group of plants with which to practice the invention include dicots, 
such as safflower, alfalfa, soybean, rapeseed (high erucic acid and canola), or sunflower, 
5 Also suitable are monocots such as com, wheat, rye, barley, oat, rice, millet, amaranth or 
sorghum. Also suitable are vegetable crops or root crops such as potato, broccoh, peas, 
sweet com, popcorn, tomato, beans (including kidney beans, lima beans, dry beans, 
green beans) and the hke. Also suitable are firuit crops such as peach, pear, apple, 
cherry, orange, lemon, grapefhiit, plum, mango and palm. Thus, the invention has use 

10 over a broad range of plants, including species fix)m the genera Anacardium, Arachis, 
Asparagus, Atxopa, Avena, Brassica, Citrus, Citrullus, Capsicum, Carthamus, Cocos, 
Coffea, Cucumis, Cucurbita, Daucus, Elaeis, Fragaria, Glycine, Gossypium, Helianthus, 
Heterocallis, Hordeum, Hyoscyalnus, Lactuca, Linum, LoUum, Lupinus, Lycopersicon, 
Malus, Manihot, Majorana, Medicago, Nicotiana, Olea, Oryza, Panicum, Pannesetum, 

15 Persea, Phaseolus, Pistachia, Pisum, Pyrus, Prunus, Raphanus, Ricinus, Secale, Senecio, 
Sinapis, Solanum, Sorghum, Theobromus, Trigonella, Triticum, Vicia, Vitis, Vigna and 
Zea. 

The skilled artisan will appreciate that the enumerated organisms are also useful 
for practicing other aspects of the invention, e.g., as host cells, as described supra, 

20 The nucleic acid molecules of the invention can be expressed in plants in a cell- 

or tissue-specific manner according to the regulatory elements chosen to include in a 
particular nucleic acid construct present in the plant. Suitable cells, tissues, and organs 
in which to express a chimeric polypeptide of the invention include, without limitation, 
egg cell, central cell, synergid cell, zygote, ovule primordia, nucellus, integuments, 

25 endothelium, female gametophyte cells, embryo, axis, cotyledons, suspensor, 

endosperm, seed coat, ground meristem, vascular bundle, cambium, phloem, cortex, 
shoot or root apical meristems, lateral shoot or root meristems, floral meristem, leaf 
primordia, leaf mesophyll cells, and leaf epidermal cells, e.g., epidermal cells involved 
in fortning the cuticular layer. Also suitable are cells and tissues grown in hquid media 

30 or on semi-soUd media. 
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4. Transgenic Fungi 

Other eukaryotic organisms featured in the invention are fimgi containing an 
exogenous nucleic acid molecule that encodes an engineered RNA precursor of the 
invention. Accordingly, a method according to the invention comprises introducing a 

5 nucleic acid molecule or construct as described herein into a fungus. Techniques for 
introducing exogenous nucleic acids into many fimgi are known in the art, see, e.g., U.S. 
Patents Nos. 5,252,726 and 5,070,020. Transformed fungi can be cultured by techniques 
known to those skilled in the art. Such fimgi can be used to introduce a nucleic acid 
encoding a polypeptide into other fimgal strains, to transfer the nucleic acid to other 

10 species or for flirther selection of other desirable traits. 

A suitable group of fimgi with which to practice the invention include fission 
yeast and budding yeast, such as Saccharoinyces cereviseae, S. pombe, S. carlsbergeris 
and Candida albicans. Filamentous fimgi such as Aspergillus spp. andPenicillium spp. 
are also usefiiL 

15 

Vin. Functional Genomics and/or Proteomics 

Preferred appUcations for the cell or organism of the invention is the analysis of 
gene ejqpression profiles and/or proteomes. In an especially preferred embodiment an 
analysis of a variant or mutant form of one or several target proteins is carried out, 

20 wherein said variant or mutant forms are reintroduced into the cell or organism by an 
exogenous target nucleic acid as described above. The combination of knockout of an 
endogeneous gene and rescue by using mutated, e.g. partially deleted exogenous target 
has advantages compared to the use of a knockout cell. Further, this method is 
particularly suitable for identifying fimctional domains of the targeted protein. In a 

25 fiirther preferred embodiment a comparison, e.g. of gene expression profiles and/or 
proteomes and/or phenotypic characteristics of at least two cells or organisms is carried 
out. These organisms are selected from: (i) a control cell or control organism without 
target gene inhibition, (v) a cell or organism with target gene inhibition and (iii) a cell or 
organism with target gene inhibition plus target gene complementation by an exogenous 

30 target nucleic acid. 
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Furthermore, the RNA knockout complementation method may be lised for is 
preparative purposes, e.g. for the affinity purification of proteins or protein complexes 
from eukaryotic cells, particularly mammalian cells and more particularly human cells. 
In this embodiment of the invention, the exogenous target nucleic acid preferably codes 
5 for a target protein which is fused to art affinity tag. This method is suitable for 
jftinctional proteome analysis in mammalian cells, particularly human cells. 

Another utihty of the present invention could be a method of identifying gene 
fimction in an organism comprising the use of an RNAi agent to inhibit the activity of a 
target gene of previously unknown function. Instead of the time consuming and 

10 laborious isolation of mutants by traditional genetic screening, functional genomics 
would envision determining the function of uncharacterized genes by employing the 
invention to reduce the amoimt and/or alter the timing of target gene activity. The 
invention could be used in determining potential targets for pharmaceutics, 
understanding normal and pathological events associated with development, determining 

15 signaling pathways responsible for postnatal development/aging, and the like. The 
increasing speed of acquiring nucleotide sequence information from genomic and 
expressed gene sources, including total sequences for the yeast, D. melanogaster, and C 
elegans genomes, can be coupled with the invention to determine gene function in an 
organism (e.g-., nematode). The preference of different organisms to use particxilar 

20 codons, searching sequence databases for related gene products, correlating the linkage 
map of genetic traits with the physical map fi:om which the nucleotide sequences are 
derived, and artificial intelligence methods may be used to define putative open reading 
fi:ames from the nucleotide sequences acquired in such sequencing projects. A simple 
assay would be to inhibit gene expression according to the partial sequence available 

25 firom an expressed sequence tag (EST). Functional alterations in growth, development, 
metabolism, disease resistance, or other biological processes would be indicative of the 
normal role of the EST's gene product. 

The ease with which RNA can be introduced into an intact cell/organism 

containing the target gene allows flie present invention to be used in high throughput 

30 screening (HTS). Solutions containing KNAi agents that are capable of inhibiting the 

different expressed genes can be placed into individual wells positioned on a microtiter 

plate as an ordered array, and intact cells/organisms in each well can be assayed for any 

changes or modifications in behavior or development due to inhibition of target gene 

-57- 



wo 2005/001043 



PCT/US2004/017256 



activity. The amplified RNA can be fed directly to, injected into, the cell/organism 
containing the target gene. Alternatively, the RNAi agent can be produced from a 
vector, as described herein. Vectors can be injected into, the cell/organism containing 
the target gene. The function of the target gene can be assayed from the effects it has on 

5 the cell/organism when gene activity is inhibited. This screening could be amenable to 
small subjects that can be processed in large number, for example: arabidopsis, bacteria, 
drosophila, fimgi, nematodes, viruses, zebrafish, and tissue culture cells derived from 
mammals. A nematode or other orgaoism that produces a colorimetric, fluorogenic, or 
luminescent signal in response to a regulated promoter (e.g., transfected with a rq)orter 

1 0 gene construct) can be assayed in an HTS format. 

The present invention may be usefiil in allowing the inhibition of essential genes. Such 
genes may be required for cell or organism viability at only particular stages of 
development or cellular compartments. The fimctional equivalent of conditional 
mutations may be produced by inhibiting activity of the target gene when or where it is 
15 not required for viability. The invention allows addition of RNAi agent at specific times 
of development and locations in the organism without introducing permanent mutations 
into the target genome. 

DC. Screening Assavs 

The methods of the invention are also suitable for use in methods to identify 
20 and/or characterize potential pharmacological agents, e,g. identifying new 

pharmacological agents from a collection of test substances and/or characterizing 
mechanisms of action and/or side effects of known pharmacological agents. 

Thus, the present invention also relates to a system for identifying and/or 
characterizing pharmacological agents acting on at least one target protein comprising: 

25 (a) a eukaryotic cell or a eukaryotic non- human organism capable of expressing at least 
one endogeneous target gene coding for said so target protein, (b) at least one RNAi 
agent molecule capable of inhibiting the expression of said at least one endogeneous 
target gene, and (c) a test substance or a collection of test substances wherein 
pharmacological properties of said test substance or said collection are to be identified 

30 and/or characterized. Further, the system as described above preferably comprises: (d) 
at least one exogenous target nucleic acid coding for the target protein or a variant or 
mutated form of the target protein wherein said exogenous target nucleic acid dififers 
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from the endogeneous target gene on the nucleic acid level such that the expression of 
the exogenous target nucleic acid is substantially less inhibited by the RNAi agent than 
the expression of the endogeneous target gene. 

The test compounds of the present invention can be obtained using any of the 
5 numerous approaches in combinatorial Ubrary methods known in the art, including: 
biological hbraries; spatially addressable parallel solid phase or solution phase libraries; 
synthetic library methods requiring deconvolution; the *one-bead one-compound' library 
method; and synthetic library methods using aflSnity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 
10 approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. (1997) Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl Acad, ScL UXA. 90:6909; Erb et 
al (1994) Proc. Natl Acad. ScL USA 91:11422; Zuckermann etal (1994). J. Med. 
15 Chem. 37:2678; Cho etal (1993) &/ence 261:1303; Carrell etal {\99A) Angew. Chem. 
Int. Ed Engl 33:2059; Carell et al {1994) Angew. Chem. Int Ed. Engl 33:2061; and in 
Gallop et al (1994) /. Med CAewi. 37:1233. 

Libraries of compounds maybe presented in solution (e.^., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips ^odor 
20 (1993) Nature 364:555-556), bacteria (LadnerUSP 5,223,409), spores (LadnerUSP 
•409), plasmids (Cull et al (1992) Proc Natl Acad Sci USA 89: 1 865-1 869) or on phage 
(Scott and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); 
(Cwirla et al (1990) Proc. Natl Acad ScL 87:6378-6382); (FeUci (1991) J. Mol Biol 
222:301-310); (Ladner supra.)). 

25 In a preferred embodiment, the hbrary is a natural product Ubrary, e.g. , a library 

produced by a bacterial, fungal, or yeast culture. In another preferred embodiment, the 
library is a synthetic compound library. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
30 applications cited throughout this application are incorporated herein by reference. 
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EXAMPLES 

Example I: Fanctionally asymmetric siRNA duplexes 

To assess quantitatively if the two strands of an siRNA duplex are equally 
5 competent to direct RNAi, the individual rates of sense and anti-sense target cleavage for 
an siRNA duplex directed against the firefly luciferase mRNA were examined (Figure 
lA). The relevant portions of the sense and anti-sense target RNA sequences are shown 
in Figure 1 A and the siRNA sequence in Figure IB. This siRNA duplex effectively 
silences firefly luciferase expression in culture human HeLa cells. Using a Drosophila 

10 embryo-derived in vitro RNAi reaction, a significant difference in the rate of target 
cleavage for the two siRNA strands was found; the anti-sense siRNA strand directed 
more efficient RNAi against a sense RNA target than the sense siRNA strand for an anti- 
sense target (Figure IB). (Anti-sense siRNA strands and sense target RNAs are always 
shown in black, and sense siRNAs and anti-sense targets, in grey). Control experiments 

1 5 showed that using siRNA duplexes with 5' phosphates did not alter this result (data not 
. shown), indicating that different rates of phosphorylation for the two strands is not the 
cause for the observed asymmetry. Surprisingly, the two stands of the luciferase duplex 
siRNA duplex, used individually as 5' phosphorylated single stands, had identical rates 
of target cleavage (Figure IC). RNAi directed by single-stranded siRNA is roughly 10- 

20 fold less efficient than that triggered by siRNA duplexes, reflecting the - 1 00-fold lower 
stability of single-stranded siRNAs in vitro and in vivo (Schwarz et al., 2002). The 
difference in the rate of cleavage directed by the sense and anti-sense strands when the 
reaction was programmed with an siRNA duplex is unlikely to reflect a difference in the 
inherent susceptibility of the two targets to RNAi. Instead, the observation that the same 

25 two siRNA strands are equally effective as single-strands, but diow dramatically 
different activities when paired with each other, indicates that the asymmetry in ttieir 
function is estabhshed at a step in the RNAi pathway prior to the encounter of the 
programmed RISC with its corresponding RNA target. 

30 Example II: Differential RISC assembly accoimts for siRNA strand functional 
asymmetry 
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To identify the source of asymmetry in the function of this siRNA duplex, the 
unwinding of the two siRNA strands when the duplex was incubated in a standard in 
vitro RNAi reaction was measured. This assay was shown previously to determine 
accurately the firaction of siKNA that is unwound in an ATP-dependent step in the RNAi 
5 pathway; no functional RISC is assembled in the absence of ATP (NjlcSnen et al,, 2001). 
Previous studies show that siRNA unwinding correlates with capacity of an siRNA to 
function in target cleavage (Nykanen et al., 2001; Martinez et al., 2002), demonstrating 
that siRNA duplex unwinding is required to assemble a RISC competent to base pair 
with its target RNA. Here, the accumulation of single standed siRNA fiom the 

1 0 luciferase siRNA duplex after 1 hour incubation in an in vitro RNAi reaction in the 
absence of target RNA was measured. After one hour of incubation with Drosophila 
embryo lysate in a standard RNAi reaction, 22% of the anti-sense strand of the luciferase 
siRNA was converted to single-strand (Figure ID; 'siRNA B* solid black bar), 
Remarkably, a corresponding amount of single-stranded sense siRNA was not detected. 

15 Instead, only 3% of the sense strand accumulated as single-stranded siRNA (Figure ID; 
*siRNA B' solid grey bar). In control experiments, no single-stranded RNA was 
detected without incubation in lysate (not shown), demonstrating that the siRNA was 
entirely double-stranded at the beginning of the reaction. Since the production of single- 
stranded anti-sense siRNA must be accompanied by an equal amount of single-stranded 

20 sense siRNA, the missing sense-strand must have been destroyed after unwinding. 

To estabUsh that the observed asymmetry in the accumulation of the two single- 
strands was not an artifact of our unwinding assay, an independent method for 
measuring the fraction of siRNA present as single-strands in protein-RNA complexes 
was. In this assay, double-stranded siRNA was incubated with Drosophila embryo lysate 

25 in a standard RNAi reaction for 1 h, then a 3 1 nt 2'-0-methyl RNA oligonucleotide 
containing a 21 nt sequence complementary to the radiolabeled siRNA strand was 
added. 2'-0-methyl oligonucleotides are not cleaved by the RNAi machinery, but can 
bind stably to complementary siRNA within the RISC (Martin Simard, GH, Craig 
Mello, and PDZ, manusci^}t in preparation). To allow recovery of RISC, the 2'-0- 

30 methyl oligonucleotide was tethered to a magnetic bead via a biotin-streptavidin linkage. 
After washing away unbound RNA and protein, the amount of radioactive siRNA bound 
to the bead was measured. The assay was performed with separate siRNA duplexes in 
which either the sense or the anti-sense strand was 5'-^^P-radiolabeled. Capture of ^^P- 
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• siRNA was observed when the 2'-(9-methyl oligonucleotide contained a 21-nt region 
complementary to the radiolabeled siRNA strand, but not when an unrelated 
oligonucleotide was used. The assay captures all RISC activity directed by the siRNA 
strand complementary to ttie tethered oligonucleotide, demonstrating that it measures 

5 siRNA present in the lysate as single-strand complexed with RISC proteins. This assay 
rec^itulates the results of the unwinding assay described above: for the siRNA in Figure 
ID; *si RNAB' open bars, nearly ten-fold more anti-sense siRNA was detected than 
sense strand. An explanation for these results is that the two strands of this siRNA 
duplex are differentially loaded into the RISC, and that single-stranded siRNA not 

10 assembled into RISC is degraded. Functional asymmetry occurred only when llie trigger 
siRNA was doubl&-stranded, not when the two siRNA strands were tested mdividually 
(Fig. IB and IQ. Thus, asymmetric assembly of RISC was a feature of the siRNA 
duplex, rather titan of either titie sequences of the individual siRNA strands or the 
'accessibility of the targeted sites to cleavage. 

15 

Example III: Base-pairing at the 5' end of the siRNA strand gates RISC assembly 

The finding that flie two siRNA strands can have different capacities to form 
RISC when paired in a duplex indicates that some feature of the 1 9 base-pairs of the 
duplex determines functional asynmietry. These base-pairs must be disrupted to produce 

20 RISC 0*fykanen et al., 2001), which contains single-stranded siRNA (Martinez et al., 
2002). The siRNAs used in Figure IB were exammed for base-pairing features that 
might distinguish the two siRNA strands. For tiie siRNA in Figure IB, the 5 ' end of the 
anti-sense siRNA strand begins with U and is tiius paired to the sense siRNA strand by 
an A:U base pair (2 hydrogen bonds). In contrast, the 5' nucleotide of the sense siRNA , 

25 strand is linked to the anti-sense strand by a C:G base pair (3 hydrogen bonds). The 
sense siRNA strand forms 8-10-fold less RISC and guides cleavage of its RNA target at 
a correspondingly slower rate than tihie anti-sense strand. A working hypothesis to 
explain the observed functional asymmetry.is that the siRNA strand whose 5' end is 
more weakly bound to the complementary strand more readily incorporates into RISC. 

30 In this view, the relative base-pairing strengths of the 5' ends of the two siRNA strands 
would determine theur relative extents of RISC formation. 



-62- 



wo 2005/001043 



PCT/US2004/017256 



As an initial test of this idea, the 5 ' nucleotide of the siRNA sense strand was 
changed from C to U (Figure IE). This changed the base pair formed between the 5 ' 
most nucleotide of the sense strand and position 19 of the anti-sense strand from the 
Watson-Crick base pair C:G to the weaker, less stable wobble pair U:Q while leaving 
5 the anti-sense strand of the siRNA unaltered. Remarkably, the change of this single 
nucleotide not only enhanced the rate of cleavage directed by the sense strand, but 
virtually eliminated the abiUty of the anti-sense strand to direct RNAi (Figure IE). 

To determine the basis for the reversed functional asymmetry for the siKNA in 
Figure IE, the amount of each strand that was single stranded after incubation of the 

10 siRNA duplex xaDrosphilia embryo lystae was determined. After Ih, nearly 30% of the 
sense siRNA strand was converted to single stranded, but no single-stranded anti-sense 
strand was detected (Figure ID; 'siElNA E'). Therefore, the simplest explanation for the 
asymmetric function of this siRNA is that the sense strand, but not the anti-sense, of this 
siRNa duplex was incorporated into RISC. Thus, a single nucleotide mutation in the 

15 sense siRNA strand of the siRNA in Figure IB completely reversed the relative abilites 
of the two strands to assemble in the enzyme complex that directs RNAi. 

The stability of the initial five base pairs of the siRNA strands was claculated in 
Figure 1 using the nearest-neighbor method and the mfold algorithm (D.H. Mathews, 
1999; Zuker, 2003). The 5' end of the sense siRNA strand in Figure IE, but not that in 

20 IB, is predicted to exist as an equilibrium of two conformers of nearly equal energy 
(Figure 10). In one conformer, the 5' nucleotide of the sense strand is bound to the anti- 
sense strand by a U:G wobble pair, whereas in the other conformer the 5 ' end of this 
siRNA strand is ui5)airedrThe analysis suggests that RISC assembly favors the siRNA 
strand whose 5 ' end has a greater propensity to fiay. 

25 To test this hypothesis further, the strand-specific rates of cleavage of sense and 

anti-sense hmnan Cu, Zn superoxide dismutase-l (sodl) RNA targets (Figure 3 A) 
triggered by the siRNA duplex shown in Figure 3B were examined. Given that the 5 ' 
ends of both siRNA strands of this diq[>lex are ui G:C base pairs, it was anticipated that 
this duplex woTild not display pronounced target cleavage asymmetry. As shown in 

30 Figure 3B, the two strands are similar in their rates of target cleavage, although the rate 
of anti-sense cleavage directed by the sense-strand is cleariy faster than the rate of sense- 
target cleavage guided by the anti-sense strand. This small difference in rate is likely 

explained by the sense-strand forming 20 base pairs with its target RNA, whereas the 
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anti-sense strand can form only 19, consistent with previous reports that the penultimate 
position of an siRNA makes a small contribution to its efiBcacy (Elbashir et al., 2001b). 
Next, the C at position 19 of the sense strand was changed to A, causing the anti-sense 
strand to begin with an unpaired nucleotide. This change, which was made to the sense- 

5 strand of the siRNA, caused the rate of target cleavage guided by the anti-sense siRNA 
strand to be dramatically enhanced and the sense strand rate to be suppressed (Figure 
3C). Because the enhancement of sense target cleavage was caused by a mutation in the 
sense siRNA strand, which does not participate in the recognition of the sense target, the 
efiFect of the mutation must be on a step in the RNAi pathway that is spatially or 

10 temporally coupled to siRNA unwinding. However, the si^pression of anti-sense target 
cleavage clearly might have resulted Ifrom the single-nucleotie mismatch between the 
sense strand and its target RNA generated by the C-to-U substitution. 

To test if the suppression of the rate of anti-sense target cleavage was a 
consequence of the position 19 mismatch, a different strategy was used to unpair the 5' 

15 end of the anti-sense strand. Figure 3D shows an siRNA in which the sense-strand is 
identical to that in Figure 3B, but the first nucleotide of the anti-sense strand has been 
changed from G to U, creating a U-C mismatch at its 5' end, in place of the G-A of 
Figure 3C. Nonetheless, this siRNA duplex showed pronounced asymmetry, with the 
anti-sense strand guiding target cleavage to the nearly complete exclusion of the sense 

20 strand (Figure 3D). Thus, the suppression of the cleavage rate of the sense-strand in 
Figure 3C was not a consequence of the position 19 mismatch. This finding is 
consistent with previous studies that suggest that mismatches with the target RNA are 
well tolerated if they occur near the 3 ' end of the siRNA guide strand (Amarzguioui et „ ^ 
al., 2003). The finding that the siRNAs in Figures 3C and 3D display profound 

25 asymmetry demonstrates that both the enhancement of the target cleavage rate of the 
anti-sense strand and the suppression of the function of the sense strand is a consequence 
of their relative abilities to enter the RNAi pathway, not their intrinsic capacity to direct 
target cleavage. 

Finally, the sense strand of Figure 3C was paired with the anti-sense strand of 

30 Figure 3D to create the siRNA duplex shown in Figure 3E. The sense strand of this 

siRNA, Kke that in Figure 3C, contains a mismatch with the anti-sense target at position 

19. Like the anti-sense siRNA strand in Figure 3D, the anti-sense strand contains a 

mismatch with the sense target at position 1 , This siRNA diq)lex directs target anti-sense 
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cleavage significantly better than the siRNA in Figure 3C, despite the fact that the two 
siRNAs contain the same sense strand (Figure 3E). 

Figures 3F, G, and H show a similar analysis in which the 5' end of the sense strand or 
position 19 of the anti-sense strand of flie siRNA in Figure 3B was altered to produce 

5 siEiNA duplexes in which the 5 ' end of the sense strand was either fully unpaired 
(Figures 3F and G) or paired in an A:U base pair (Figure 3H). Again, unpairing the 5' 
end of an siRNA strand— the sense strand, in this case — caused tibat strand to function to 
the exclusion of the other strand. When the sense strand 5 ' end was present in an A:U 
base pair and the anti-sense strand 5 ' end was in a G:C pair, the sense strand dominated 

1 0 the reaction (Figure 3H), although now the anti-sense strand showed activity similar to 
that seen for the original siRNA (Figure 3B) in which both strands were in G:C pairs at 
their 5 ' ends. Converting the unpaired 5 ' end of the siRNAs in Figure 3 to an A:U pair 
reduced the functional asymmetry of the two strands by enhancing the efficacy of the 
sense strand (Figure 3E) or the anti-sense strand (Figure 3H). The relative ease with 

15 which the 5 ' ends of the two siRNAs can be liberated firom the duplex determines the 
degree of asymmetry. Additional data supporting this idea is shown in Figure 8, using a 
different siRNA. Figure 8B shows an siRNA that cleaved the two sodl target RNAs 
(Figure 8A). with modest functional asymmetry that reflects the collective base pairing 
strength of the jSrst four or five nucleotides of each siRNA strand (Figure 8E; see 

20 below). Asymmetry was dramatically increased when a G:U wobble was introduced at 
the 5' end of the anti-sense strand of the siRNA (Figure 8C), but no asymmetry was seen 
when the individual singje-strands strands were used to trigger RNAi (Figure 8D), 
demonstrating that differential RISC assembly, not target accessibility, explains the 
functional asymmetry of the siRNA duplex. 

25 Together, the data m Figures 1, 2, and 8 indicates that the symmetry of RISC 

assembly is detemiined by a competition between the fi:aying of the 5' ends of the two 
siRNAs in the duplex. Such firaying may initiate a directional process of unwinding in 
which the strand at which unwinding is initiated preferentially enters RISC. Such a 
model requires that either that RISC assembly factors or RISC components themselves 

30 are loaded onto one of the two siRNA strands before unwinding is completed, or that 
information about the siRNA strands prior state of pairing is retained, perh£?)s by a 
protein such as the helicase remaining bound to a strand. 
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Example IV: A single hydrogen bond can determine ivhich strand of an siRNA 
duplex directs RNAi 

To explore this hypothesis further, additional changes were made to the sodl- 
specific siRNA in Figure 3. These modifications alter the function of the two strands of 
5 the siKNA, but do not change the site cleaved on the two target RNA*s. hi Figure 3 A, 
the anti-sense strand of Figure 3B was paired with a sense strand identical to that m 
Figure 3B except the 5' G was replace with inosine (I). Like Q I pairs with C, but 
makes two instead of three hydrogen bonds. In this respect, an I:C pair is similar in 
eneigy to an A:U pair. The resulting siRNA was functionally asymmetric, when the 

1 0 sense-strand began with an I, it directed teaiget cleavage more ejSiciently than antisense- 
strand (Figure 4A). The asymmetry reflects an enhancement in efficacy of the sense 
siRNA strand, with Uttle loss in the function of the anti-sense strand. An inosine at the 
5' end of the anti-sense strand had the opposite effect. When the G at position 1 of the 
anti-sense strand was substituted with inosine and the sense strand is that of Figure 3B, 

15 the anti-sense strand was enhanced relative to the sense strand (Figure 4B). Thus, the 
strand whose 5 ' end is in the weaker base pair was more effective at target cleavage. 

Remarkably, when the 5' nucleotides of both siRNA strands engage in I:C base pairs 
(Figure 4C), the relative eflBcacy of the two siRNA strands is restored to that reported in 
Figure 3B. The slightly faster rate for anti-sense target cleavage than for sense target 

20 cleavage is also seen for RNAi triggered with the individual, inosine-containing single 
strands, indicating that it reflects a difference in the intrinsic capacity of the two strands 
to guide cleavage, rather than a difference in RISC assembly. Although the relative rates 
of cleavage of the two strands are comparable for the siRNAs in Figure 3B and 4C, the 
absolute rates are faster for the siRNA in Figure 4C. These data indicate that production 

25 of RISC fi-om an individual strand is governed both by the relative propensity of the 
siRNA 5 ' end to fray compared to that of its conq)lementary strand and by the absolute 
propensity of the siRNA 5' end to fray. This latter finding is particularly unexpected, in 
that it shows that a difference of a single hydrogen bond has a marked effect on the rate 
of RISC assembly. siRNA end firaying provides an entry site for an ATP-dependent 

30 RNA helicase that unwinds siRNA duplexes (Figure 4). The helicase makes many 
abortive attempts to dissociate the two siRNA strands before succeeding to load one 
sfarand into RISC. The involvement of a helicase in RISC assembly is supported by 
previous observations: (1) both siRNA unwinding and production of functional RISC 
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require ATP in vitro (Nykanen et aL, 2001) and (2) several proteins with sequence 
homology to ATP-dependent RNA helicases have been implicated in KNA silencing 
(Wu-Scharf et al., 2000; Dalmay et al., 2001 ; Hutvagner and Zamore, 2002; Ishizuka et 
al., 2002; Kennerdell et al., 2002; Tabara et al., 2002; Tijsterman et al.,2002). 
5 The effect of single-nucleotide misnoiatches in this region of the siRNA, using a 

series of siRNAs containing a naismatch at the second, third, or fourth position of each 
siRNA strand was further tested. The siRNAs bearing G:U wobble pairs at the second, 
third, or both second and third positions (Figure 1 1) was also analyzed The results of 
this series demonstrate that mismatches, but not G:U wobbles, at positions 2-4 of an 

1 0 siRNA strand alter the relative loading of the two siRNA strands into RISC. 

Mismatches at position five, have very modest effects on the relative loading of the 
siRNA strands into RISC (data not shown). In contrast, the effects of internal 
mismatches at positions 6-15 cannot be explained by their influencing the symmetry of 
RISC assembly (data not shown). In sum, these data are consistent with the action of a 

15 non-processive helicase that can bind about four nucleotides of RNA. 

Example V: Implications of siRNA asymmetry in miRNA biogenesis 

One implication of the jfindings presented herein is that although siRNAs are 
predominantly present as duplexes at steady state in vitro (Nykanen et al., 2001) and 

20 perhqps in vivo (Hamilton and Baulcombe, 1 999; Djikeng et al., 2001), both strands of 
an siRNA are unlikely to be present equally in RISC. That is, the strength of the base 
pairs at the 5' mds of the two siRNA strands can influence their accumulation as single- 
strands. When the 5' end of one strand is unpaired, this asymmetry can be nearly 
absolute. This observation suggested that asymmetric incorporation into RISC, as a 

25 consequence of directional unwinding from a frayed end of an siRNA duplex, might also 
explain why miRNAs accumulate as single strands. Animal miRNAs are derived from 
the double-stranded stem of - 70 nt stem-loop precursor RNAs (Lee et al., 1993; 
Pasquinelli et al., 2000; Reinhart et al., 2000; Lagos-Quintana et al, 2001; Lau et al., 
2001; Lee and Ambros, 2001; Lagos-Quintana et al., 2002). pre-miRNAs stems are only 

30 partially double-stranded; the typical pre-miRNA contains mismatches, intemal loops, 

and G*U base pairs predicted to distort an A-fonn RNA helix. miRNAs are generated 

from pre-miRNAs by the double-stranded RNA-specific endonuclease Dicer (Hutv&gner 

et al., 2001; Grishok et al., 2001; Ketting et al., 2001). It was previously proposed by 
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the instant inventors that miRNAs are single-stranded because helical discontinuities 
constrain Dicer to break only two, rather than four, phosphodiester bonds, yielding a 
single-stranded miRNA, rather than an siRNA-like duplex (Hutvdgner et al., 2001). 
Such a mechanism has precedent, because E. coli RNase in can be constrained by 
5 helical distortions to make only one or two breaks in an KNA chain (Chelladurai et al., 
1993). 

An alternative hypothesis is that the Dicer cleaves four phosphodiester bonds in 
all of its subtrates, both Iqng dsRNA and pre-miRNAs, and always generates a product 
with the essential siKNA duplex (Hutvigner and Zamore, 2002; Remhait et al., 2002; 

1 0 Lim et al., 2003b). This mechanism for miRNA production was originally suggested by 
Bartel and colleagues. Using a small RNA cloning strategy to identify mature miRNAs 
in C. elegans, they recovered small RNAs corresponding to the non-miRNA side of the 
precursor's stem (Lim et al., 2003b). Although these *miRNA*' sequences were 
recovered at about 100 times lower frequency than the miRNAs themselves, they could 

15 always be paired with the corresponding miRNA to give 'miRNA duplexes' with 2 nt 
overhanging 3' ends (Lim et al., 2003b). Their data suggest that miRNAs are bom as 
duplexes, but accumulate as single-strands becaxise some subsequent process stabilizes 
the miRNA, destabilizes the miRNA*, or both. 

The incorporation of miRNA mto RISC is this process. Our results 

20 with siRNA suggest that preferential assembly of a miRNA into the RISC would be 
accompanied by destruction of the naiRNA. If the rate asymmetric RISC assembly was 
faster than the production of the miRNA diiplexes, only single -stranded miRNAs would 
be observed at steady-state (Figure 4). The accumulation of single-strands and not 
duplexes for miRNAs would shnply be a consequence of Dicer being significantly less 

25 efiBcient in cleaving pre-miRNAs compared to long dsKNA (Hutvdgner et al., 2001). 
The rate of asymmetric RISC assembly might be fester than the production of miRNA 
duplexes, so only single-stranded miRNAs would be observed at steady-state. Two key 
predictions of this hypothesis are that (1) purified Dicer should cleave pre-miRNAs into 
equal amounts of miRNA and nodRNA* products and (2) pre-miRNA stractures should 

30 be processed by Dicer into duplexes with the 5 ' end of the miRNA strand frayed or 
weakly hydrogen bonded and the 5 ' end of the miRNA* strand more securely base 
paired. 
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A. ' Dicer cleaves pre-to-7 symmetrically 

To begin to test the idea that pre-miRNA are cleaved by Dicer to generate a 
product with an essential stnicture of an siRNA a duplex, we incubated the Drosophila 
pre-miRNA, pie-let-?, with purified, recombinant Dic^ and analyzed the products by 
5 Northem hybridization using probes specific for either the 5 ' side of the precursor stem 
that encodes mature let-? or for products derived from the 3' side of the precursor stem 
(/er-7* products). As a control, the let-7 precursor KNA was incubated in Drosophila 
embryo lysate, which rec^itulates both pre-to-7 maturation and RNAi in vitro. As 
previously reported, incubation of pr&-fef-7 RNA in the lysate produced a single band 

. 10 corresponding to authentic let-7, but no fe/-7* products (Hutvigner et al., 2001 ; Figure 
5 A and 5B). In contrast, incubation of pre-/^r-7 with Dicer yielded approximately equal 
amounts of let-7 and let-?* products. At least three distinct RNAs were generated from 
each side of the stem, rather than the single band corresponding to mature Iet-7 observed 
in the embryo lysate. Thus, the absence of let-7* in vivo and in the embryo lysate 

1 5 reaction cannot be explamed by the influence of pre-feN7 stmcture on Dicer. 

B. Asymmetric RISC assembly explains why mlRNAs are single-stranded 

If Dicer cleaves both sides of the pre-/e/-7 stem, then some step downstream 
from Dicer action selects mature let-7 from an siRNA-like duplex in which Iet-7 is 

20 paired with let-7*. A good candidate for such a step would be the asymmetric 

incorporation of let'7 into RISC, accompanied by the degradation of let-7*. To test this 
idea, the siEiNA that might be formed if pre-fe^7 were cleaved by Dicer into an siRNA 
duplex-like structure was deduced. The sequence of this *pre-/er-7 siRNA,' generated by 
'conceptual dicing,' is shown in Figure 6 A (see below). Notably, the 5 ' end of fe/-7 is 

25 unpaired in this duplex, whereas the 5' end of the let-7* strand is in an A:U base pair. 
The results presented in Figures 2, 3, and 4 suggest that this stracture should cause the 
let-7 strand to enter the RISC to the near exclusion of the let-7* strand, which would 
consequently be degraded. 



30 
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C. miRNA versus mllUSA* selection in 2)w5i>p/rifa 

This analysis was next extended to the other published Drosophila miRNA genes 
(Lagos-Quintana et al., 2001). For each precursor structure, the double-strand predicted 
to be produced by Dicer. These conceptually diced duplexes are shown in Figure 6A. 

5 For 23 of the 27 duplexes generated by this analysis (including pre-fef-7), the diflference 
in the base pairing of &st five nucleotides of the miRNA versus miRNA* strands 
accurately predicted the miRNA, and not the miRNA,* accumulates in vivo. The 
analysis succeeded irrespective of which side of the pre-miRNA stem encoded the 
mature miRNA. This analysis, previous observations that smgle mismatches in the first 

1 0 four nucleotides of an siRNA strand, an initial G:U wobble pair, but not internal G:U 
wobbles, directed the asymmetric incorporation of an siRNA strand into RISC (Figures 
1, 2, 3, 8, 9, and 1 1). However, no difference was discerned in the propensity to &dy of 
the 5' ends of the miRNA and * strands for miR-4, miR-5, the three niiR-6-2 paralogs, 
and miR-lO. Therefore, it could not be explained why a particular strand would 

1 5 accumulate as the mature miRNA for these three miRNA precursors. miR-5 and miR- 
10, like other Drosophila miRNAs, were identified by the cloning and sequencing of 
small RNAs from embryos (Lagos-Quintana et al., 2001). Determinants other than end 
fraying appear to fimction in the selection of miR-4 and miR-6; these unknown 
determinants may also play a role in the assembly of an siRNA atrand into RISC. 

20 However, miR-5 and miR-1 0 were cloned only once, raising the possibility that miR-5* 
or miR-10* is present in embryos, but not represented among the library of small RNA's 
fiom which the miRNAs were cloned. Similarly, miR-6 is encoded by three paralogous 
genes, only one of which we predict to produce detectable amounts of the miR*, so this 
* strand might have also gone undetected. To test if both the miRNA and * strands 

25 might accumulate for some or all of these three genes. Northern hybridization was used 
to examined the relative abundance of miR-10 and miR-lO* in adult Drosophila males 
and females, and in syncitial blastoderm embryos. The results detected both miR-lO* 
and mi-Rl 0 in vivo (Figure 6C). In fact, the results indicated that more miR-lO* was 
detected that miR-10 in adult males. This finding strengthens the proposal that miRNA 

30 genes {le., premiRNA's) uniquely specify on which side of the stem the miRNA 

residues by generating siRNA-like duplexes firom which only one of the two strands of 
the duplex is assembled into RISC. When these double-stranded intermediates do not 
contain structural features enforcing asymmetric RISC assembly, both strands 
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accumulate in vivo. It is possible that pre-miRNAs such as pre-miR-10, which generates 
roughly equal amounts of small RNA products from both sides of the precursor stem, 
simultaneously regulate target RNAs with partial complementary to both small RNA 
products. 

5 

Example VI: Increased rate of siRNAefficency through the use of dTdT 
tails 

Art-recognized protocols for designing siRNA duplexes teach the inclusion of 
dTdT tails (i.e., 2-nucleotide overhangs consisting of dTs). Two duplexes were created to 

10 test whether the addition of 3 'overhanging dTdT tails increases the rate of sflRNA 
taegeting ef&ciency of the Cm; Zn siq}eroxide-dismutase-l (Sodl) mRNA. The first 
duplex contained sense and antisense stands, each including 21 nucleotides with 19 
complementary bases plus 2-nucleotide overhangs (the overhangs onsisting of bases in 
common with the target sequence). The second duplex contained sense and antisense 

- 1 S strands, each including 1 9 complementary nucleotides (in common with the Sodl 

target), plus 2-nucleotide dTdT tails at the 5* end of the strand (not matching the Sodl 
target). Results demonstrate that the rate of siRNA efficiency improved - 8 fold ~ when 
using the diq>lex having mismatched dTdT tails (Figure 12). 

20 Discussion of Examples I-VI: Implications for RNA silencing 

The observations described herefai provide rules for siRNA design. Clearly, 
siRNA structure can profoundly influence the entry of the anti-sense siRNA strand into 
the RNAi pathway. Thus, the sequence of the siRNA, rather than that of the target site, 
may explain at least some previous reports of ineflEective siRNAs duplexes. Such 

25 inactive duplexes may be coaxed back to life by modifying the sense strand of the 

slEINA to reduce the strength of the base pair at the 5 ' end of the anti-sense strand. An 
example of this in vitro is shown in Figure 9, for an ineffective siRNA directed against 
the huntingtin (hit) mRNA (Figure 9A). Changing the G:C (Figure 9B) to an A:U pair 
(Figure 9C) or a G-A mismatch (Figure 9D) dramatically improved its target cleavage 

30 rate in vitro and its efl5cacy in vivo (Eftim Milkani, NA, and PDZ, unpublished 
observations). In fact, Khvorova and colleagues have found that a low base-pairing 
stability at the 5' end of the antisense strand, but not the sense strand, is a prerequisite 
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for siKNA function in cultured mammalian cells (Anastasia Khvorova, Angela 
Reynolds, and Sumedha D. Jayasena, manuscript submitted). 

siRNAs designed to function asymmetrically may also be uses to enhance 
RNAi specificity. Recently, expression profiling studies have shown that the sense- 
5 strand of an siRNA can direct off-target gene silencing (A.L. Jackson, et al. (2003) 
Nature Biotechnology, May 18). The data presented herein provide a strategy for 
eliminating such sequ«ace-specific but undesirable effects: redesigning the siRNA so 
that only the anti-sense strand enters the RNAi pathway. 

The observations described herein provide new design rules for the construction 
of short hairpin RNAs (shRNAs), which produce siRNAs transcriptionally m cultured 
cells or in vivo (Brummelkamp et al, 2002; McManus et al, 2002; Paddison et a!., 2002; 
Paul et al., 2002; Sui et al., 2002; Yu et al., 2002). shRNA strategies typically employ a 
Pol in promoter to drive transcription, so the shRNA must begin with several G 
residues. As a consequence, the 5' end of the siRNA may be sequestered in a G:C base 
pair, significantly reducing entry of the anti-sense strand into the RNAi pathway. To 
avoid this problem, the anti-sense strand of the desired siRNA can be placed on the 3 ' 
side of the loop, so as to ensure that its 5 ' end is in an A:U, rather than the G:C pair 
typically encoded. Alternatively, the hairpin can be designed to place the 5 ' end of the 
anti-sense siRNA strand in a mismatch or G»U base pair, in which case it can be placed 
on either side of the stem. Moreover, a recent report suggests that some shRNAs 
may induce the interferon response (Bridge et al., 2003). The data suggest that 
mismatches and G:U pairs could be designed into these shRNAs simultaneously to 
promote entry of the correct siKNA strand into the RNAi pathway and to diminish the 
capacity of the shRNA stem to trigger non-sequence specific responses to double- 
stranded RNA. 

Finally, the data identify an unanticipated step in the RNAi pathway: the direct 
coupling of siRNA unwinding to RISC assembly. This finding suggests that the helicase 
responsible for unwinding siRNA duplexes will be intimately linked to other 
components of the RNAi machinery. Identifying the helicase and the proteins with 
which it fimctions to assemble the RISC is clearly an important challenge for the fixture. 
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Example VII: The siRNA-programmed RISC is an enzyme 

RISC programmed with small RNA in vivo catalyzes the destruction of target 
KNA in vitro without consuming its small RNA guide (Tang et al., 2003) (Hutvagner et 
al., 2002). To begin a kinetic analysis of RISC, the RISC programmed in vitro with 
5 siRNA is likewise a multiple-turnover enzyme was first confirmed. To engineer an 
RNAi reaction that contained a high substrate concentration relative to RISC, an siRNA 
was used in which the guide strand is identical to the let-7 miRNA, but unlike the 
miRNA, the let-7 siRNA is paired to an RNA strand anti-sense to let-7(Hutvagner et al., 
2002). The let-7 strand of this siRNA has a high intrinsic cleaving activity, but a reduced 

10 efficiency of incorporation into RISC (Figure 19A). 

After incubating the let-7 siRNA with Drosophila embryo lysate in the presence 
of ATP, RISC assembly was inactivated by treatment with N-ethyl maleimide (NEM), 
and the amount of RISC generated was measured usmg the previously described 
tethered 2'-0-methyl oligonucleotide assay (Hutvagner et al., 2004; Schwartz et al., 

15 2003) (Figure 19 B,C). The amount of let-7 programmed RISC inoreased with 

increasing siRNA concentration, until the assembly reactioii began to saturate at -50 
nM, reaching an asymptote between 3 and 4 nM RISC. Using 0.6 nM RISC, >50 cycles 
of target recognition and cleavage per enzyme complex (data not shown) was observed, 
confirming that siRNA-programmed RISC is a multiple-turnover enzyme. 

20 

Example VIII: Multiple-turnover is limited by product release 

The evaluation of the kinetics of siRNA-dirccted target cleavage in the presence 
or absence of ATP was further performed (Figure 13). RISC was assembled in the 
presence of ATP, then the energy regenerating enzyme, creatine kinase, was inactivated 

25 with NEM, and ATP depleted by adding hexokinase and glucose (~ATP conditions). For 
+ATP measurements creatine kinase was added to the reaction after NEM-treatment, 
and the hexokinase treatment was omitted. A faster rate of cleavage in the presence than 
in the absence of ATP was observed. This difference was only apparent late in the 
reaction time course, indiciating that the ATP-dependent rate of cleavage was faster than 

30 Ihe ATP-independent rate only at steady state (Figure 1 3 A). The analysis was repeated 
in more detail (Figure 13 B). In the absence of ATP, a burst of cleaved product early m 
the reaction, followed by a -4-fold slower rate of target cleavage was observed. No burst 
was observed in the presence of ATP (Figure 13 A). If the burst corresponds to a single- 
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turnover of enzyme, then extrapolation of the slower steady state rate back to the y-axis 
should give the amount of active enzyme in the reaction. The y-intercept at the start of 
the reaction for the steady-state rate was 4.9 nM, in good agreement with the amount of 
RISC estimated using the tethered 2'-0-methyl oligonucleotide assay nM; Figure 13 
5 B). 

In principle, ATP could enhance target recognition by RISC, promote a 
rearrangement of the RISC/target complex to an active form, facilitate cleavage itself, 
promote the release of the cleavage products from the siRNA guide strand, or help 
restore RISC to a catalyticaUy competent state after product release. All of these steps, 

10 except product release and restoration to catalytic competence, should affect the rate of 
both multiple and single-turnover reactions. Therefore, the rate of reaction in the 
presence and in the absence of ATP under conditions in which RISC was in excess over 
flie RNA target was analyzed. At early timea under these conditions, the reaction rate 
should reflect only single-turnover cleavage events, in which events after cleavage do 

1 5 not determine the rate of reaction. Using single-turnover reaction conditions, idaitical 
rates of RISC-mediated cleavage in the presence or absence of ATP was observed 
(Figure 13 C). Thus, ATP must enhance a step that occurs only when each RISC 
catalyzes multiple cycles of target cleavage. 

If product release is rate-determining for multiple-turnover catalysis by RISC in 

20 the absence, but not the presence, of ATP, then modifications that weaken the strength 
of pairing to the target RNA might enhance product release, but would not be expected 
to accelerate the return of the RISC to a catalyticaUy competent state. Mismatches 
between the siRNA and its RNA target at the 3 ' end of the siRNA guide strand was 
incorporated and designed the siRNAs to be functionally asymmetric, ensuring efficient 

25 and 

predictable incorporation of the let-7 strand into RISC (Figure 14 A). The reaction 
velocity under conditions of substrate excess in the presence and in the absence of ATP 
for siRNAs with zero to four mismatches between the guide strand 3 ' end and the RNA 
target wore compared. Cleavage was measured firom 100 and 540 s, when > 90% of the 
30 target remained uncleaved, ensuring that the multiple-turnover reaction was at steady 
state. Even a single 3 ' mismatch between the siRNA and its target increased the -ATP 
rate, relative to the +ATP rate, and siRNAs with two or more mismatches showed no 
significant difference in rate between the presence and absence of ATP (Figure 14B). 
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The results indicated that in the absence of ATP, product release is the rate-determining 
step for siRNAs fully matched to thek KNA targets. 

Example IX: siRNA:target complementarity and RISC function 

5 Mismatches between the siKNA and its target facilitate product release, but not 

without cost: the rate of reaction, irrespective of ATP concentration, decreases with each 
additional 3' mismatch. When the concentration of RISC was -16-80-fold greater than 
the target RNA concentration, each additional mismatch between the 3 ' end of the 
siEiNA guide strand and the RNA target further slowed the reaction (Figure 14 C,D). 

10 Under conditions of substrate excess, the effect of mismatches between the 3 ' end of Ae 
siRNA guide strand and its RNA target was more striking ^Figure 15 A): the rate of 
cleavage slowed -20% for each additional mismatch. To test the limits of the tolerance 
of RISC for 3' mismatches, cleavage under modest (8-fold, Figure 15 B) and vast (-'80- 
fold, Figure 15 C and 16 C). enzyme excess over target RNA was analyzed. Remarkably, 

15 cleavage was detected for siRNAs with as many as nine V mismatches to the RNA 
target (Figure 15 C and 16 C), but ordy after 24 hour incubation. No cleavage was 
detected for an sSSHA with ten 3 ' mismatches to the RNA target (Figure 15 C). 

Linsley and colleagues have proposed siRNA-directed down-regulation of an 
mRNA with as few as eleven contiguous bases complementary to the siRNA guide 

20 strand (Jackson et aL, 2003). In that study, the mRNA target paired with both nts 2-5 
and nts 7-17 of the siElNA guide strand, but mismatched at nts 1 and 6 of the siRNA. 
Results indicated that up to five mismatched bases are tolerated between the 5' end of 
the siRNA and its RNA target (Figure 16 AJB). No cleavage was detected for siRNAs 
with six, seven, or eight 5' mismatches to the target, even after 24 hour incubation. The 

25 siRNA bearing eight mismatches between its 5 ' end and the let-7 complementary target 
was fully active when eight compensatory mutations were introduced into the let-7 
binding site (Figure 15C and 16B), demonstrating that mutation of the siRNA was not 
the cause for its inactivity against the mismatched target Similarly, when eight 
mismatches with the 3' or 5 ' end of the siElNA were created by changing the sequence 

30 of the RNA target, target RNA cleavage when the target contained eight mismatches 
with the siRNA 3 ' end, but not with flie 5 ' end was detected (Figurel6 B,C). 
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To begin to estimate the minimal number of base pairs between the siKNA and 
its target that permit detectable cleavage by RISC at 24 hour incubation, seven, eight, or 
nine 3' mismatches with increasing numbers of 5' mismatches were combined (Figiure 
16 C). Cleavage was detected for as many as nine 3 ' mismatches. However, no 
5 detectable cleavage occxirred when seven, eight, or nine 3 ' mismatches were combined 
with two or more 5' mismatches. In contrast, a single 5' mismatch (pi) enhanced target 
cleavage directed by all three 3 'mismatched siKNAs. Only 6% of the target RNA was 
cleaved after 24 hours when the siRNA contained nine contiguous 3 ' mismatches with 
the target RNA, but 10% was cleaved when the siRNA contamed both nine 3 ' 

10 mismatches and a single (pi) 5 ' mismatch. Cleavage was similarly enhanced by the 
- addition of a p 1 mismatch to seven 3 ' mismatches (49% cleavage versus 75% cleavage 
at 24 hours) or to eight 3 ' mismatches (21% versus 42% cleavage at 24 hours). The 
finding that unpaiiing of the first base of the siRNA guide strand potentiated cleavage 
under single-turnover conditions inducated that a conformational change occurs in RISC 

15 during which the paired pi base becomes unpaired prior to cleavage. Intriguingly, pi is 
often predicted to be unpaired for miRNAs bound to their targets (Lewis et al., 2003; 
Rhoades et al., 2002; Stark et al., 2003). 

For siRNAs that pair fully with their RNA targets, the scissile phosphate always 
lies between the target nucleotides that pair with siRNA bases 10 and 1 1 (Elbashir et al., 

20 2001 ; Elbashir et al., 2001). Analysis at single nucleotide resolution of the 5 ' cleavage 
products generated by siRNAs with three, four, or five 5 ' mismatches (Figure 1 6 D) or 
six y terminal mismatches (data not shown) revealed that Ihe scissile phosphate on the 
target RNA remained the same, even when five 5 ' nts.of the siRNA guide strand were 
mismatched with the target RNA (Figure 16 D). As discussed below, this result indicates 

25 that the identity of the scissile phosphate is a consequence of the structure of RISC, 
rather than bemg measured firom the 5' end of the helix formed between the siRNA and 
its RNA target. 

Example X: Kinetic analysis of RISC catalysis 
30 The role of nucleotides in the terminal regions of the siRNA guide strand in 

directing RISC activitywas next studied. Reduced pairing between an siRNA and its 
target might disrupt the binding of RISC to its target Alternatively, mismatches mi^t 
disrupt the structure, but not the afSnity, of the siRNA/target interaction. Fully matched 
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siRNAs are thought to form a 21 base-pair, A-form heHx with the target RNA (Chiu et 
al., 2003; Shiu et al., 2002), but do all parts of this helix contribute equally to target 
binding or do some regions provide only a catalytically permissive geometry? To 
distinguish between these possibilities, the Michaelis-Menten kinetics of siRNA-directed 

5 target-RNA cleavage for a perfectly matched siKNA and for three siRNAs mismatched 
at their termini was analyzed. siRNAs were assembled into RISC, then diluted with 
reaction buffer to the desired RISC concentration and mixed with target RNA. For each 
siRNA, the initial velocity of reaction was determined at multiple substrate 
conc^trations (Figure 21 A), and KM and kcat determined from a non-linear least 

10 squares fit of substrate concentration versus initial velocity (Figure 17 A). By HAs assay, 
the KM of the let-7 siRNA with complete complementarity to its target was -8.4 nM 
was estimated (Table 1). A significant difference in KM, within error, between the fiiUy 
paired siRNA and siRNA variants bearing three to five mismatches at their 3 ' end or 
three mismatches at their 5' end was not detected (Figure 17 A and Table 1). For the 

1 5 mismatched siRNAs a higher than optimal enzyme concentration in order to detect 
cleavage was used. Therefore, the KM measurements for the mismatched siRNAs 
represent an upper bound for the actual KM values. 

While the KM was unaltered for the let-7 siRNA containing several terminal 
mismatches, the turnover number, kcat, was decreased by terminal mismatches (Table 

20 1). Three mismatches at either the 3 ' or the 5 ' end of the siRNA halved the kcat The 
mtroduction of five, 3' mismatches also had no significant effect on KM, yet decreased 
kcat nearly 17-fold (Table 1). 

Table 1 Summarizes the kinetic data from the analysis in Figure 17 A. For 
comparison, the KM and kcat values of four well studied protein enzymes are provided. 

25 KM and kcat ± error of fit are rq)Qrted, 



Example XI; KM reflects the binding strength of RISC 

To estimate the contribution of binding to KM, a competition assay that 

30 measures the ability of 2'-0-mefliyl oligonucleotides to inhibit target cleavage by RISC 

was used (Figure 17 B,C). Such a strategy was used previously to analyze the 

mechanism of target destruction by antisense oligonucleotides that recrait RNase H 

(Lima et al., 1997), The anticipation was that 2'-0-mefliyl oligonucleotides would act as 
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competitive inhibitors of RISC, because they bind to RISC containing complementary 
siRNA but not to RISC containing unrelated siRNA (Hutvagner et al, 2004; Meister et 
al., 2004). Thirty-one nt long, 2'-0-methyl oligonucleotides were designed as described 
previously (Hutvigner et al., 2004), taking care to exclude sequences predicted to form 

5 stable internal structures. 2'-0-methyl oligonucleotides were chosen because of their 
marked stability in Drosophila lysate and because they can be added to the reaction at 
high concentration. 

Competition by 2'-0-methyl oligonucleotides and bona fide RNA targets was 
quantitatively similar. The reaction velocities of siRNA-directed cleavage of a 32P- 

10 radiolabled target fai the presence of increasmg concentrations of unlabeled capped RNA 
target or a 31-nt 2'-0-methyl oligonucleotide corresponding to the region of the target 
containing the siRNA binding site was analyzed (Figure 17 B). Lineweaver-Burk 
analysis of the data confirm that 2'-0-methyl oligonucleotides act as competitive 
inhibitors of RISC (data not shown). These data were used to calculate Ki values for the 

15 perfectly matched RNA and 2'-0-m6thyl conq)etitor5. For the capped RNA competitor, 
the Ki was -7.7 ± 4 nM (Figure 17 B), nearly identical to the KM for this siRNA, 8.4 
nM (Table 1). The Ki for the perfectly matched 2'-0-methyl competitor oligonucleotide 
was 3.2 ± 1 nM (Figure 17 B), essentially the same, within error, as that of the all-RNA 
competitor. The results indicated that 2'-0-methyl ohgonucleotides are good models for 

20 5 '-c^ped RNA targets and that the KM for target cleavage by RISC is largely 
deteraiined by the affinity (KD) of RISC for its target RNA. 

Although targets with more than five contiguous mismatches to either end of the 
siRNA are poor substrates for cleavage, they might nonetheless bind RISC and compete 
with the 32P-radiolabeled target RNA. The 2'-0-methyl oligonucleotide competition 

25 assay to determine the Ki values for oligonucleotides containing as many as eight 
mismatches to the siRNA guide strand was used (Figure 17 B). 2'-0-methyl 
oligonucleotides with 3' terminal mismatches to the siRNA were good competitors: a 
four nucleotide mismatch with the 3 ' end of the siRNA increased the Ki by only -S-fold 
(9.0 ± 0.9 nM) and an eight nucleotide mismatch with the 3 ' end of the siRNA increased 

30 the Ki by ~1 0-fold (34.8 ± 7 riM). In contrast, mismatches with the 5 ' end of the siRNA 
had a dramatic effect on binding. A four nucleotide mismatch to the 5 ' end of the siRNA 
increased the Ki -12-fold (36.4 ± 9.2 nM) and an eight nucleotide mismatch to the 5 ' 
end of the siRNA increased the Ki 53-fold (173 ±16 nM). The differential effect on 
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binding between 5 ' and 3 ' mismatches was maintained even at the center of the siKNA: 
a 2'-0-methyl oligonucleotide bearing four mismatches with siRNA nucleotides 11,12, 
13, and 14 (4 nt 3' central mismatch, Figure 17 B) bound more tightly to RISC (i.e., had 
a lower Ki) than an oligonucleotide with four mismatches to siRNA positions 7, 8, 9, 
5 and 10 (4 nt 5' central mismatch. Figure 17 B). 

Discussioii of Examples VII-XI: 

RISC programmed with exogenous siRNA is an enzyme, capable of multiple 
rounds of target cleavage. Prevoius studies showed that cleavage of a target RNA by 

10 RISC does not require ATP (Nykanen et al., 2001; Tomari et al., 2004). The more 

detailed kinetic analysis presented herein indicates that there are no ATP-assisted steps 
in either target recognition or cleavage by DrosopWla RISC; no difference in rate in the 
presence or absence of ATP for RNAi reactions analyzed under conditions of substrate 
excess at early time points (pre-steady state) or under conditions of enzyme excess 

15 where the reaction was essentially single-turnover was detected. In contrast, the steady- 
state rate of cleavage under multiple turnover conditions wias enhanced four-fold by 
ATP. The results indicates that release of the products of flie RISC endonuclease is rate 
determining under these conditions in the absence of ATP, but not in the presence of 
ATP. The most straightforward explanation for this finding is that an ATP-dependent 

20 RNA helicase facihtates the dissociation of the products of target cleavage firam the 
RISC-bound siRNA. The involvement of such an ATP-dependent helicase in RNAi in 
vivo may explain why siRNAs can be active within a broad range of GC content 
(Reynolds etal., 2004). 

In the presence of ATP, siRNA-programmed Drosophila RISC is a classical 

25 Michaelis-Menten enzyme. The guide strand of the siRNA studied here has the sequence 
of let-7, an endogenous miRNA. In vivOy let-7 is not thought to direct mRNA cleavage, 
but rather is beUeved to repress productive translation of its mRNA targets. Nonetheless, 
the Iet-7 siRNA is among the most potent of the siRNAs we have studied in vitro and 
provides a good model for effective siRNA in general. With a kcat of ~7 x 10-3 Sr-1 , the 

30 let-7 siRNA-prograimned RISC was slow compared to enzymes with small molecule 
substrates (Table 1). The KM for this RISC was -8 nM. Enzymes typically have KM 
values between 1- and 100-fold greater than the physiological concentrations of their 
substrates (Stryer et al., 1981). The results indicate that RISC is no exception: individual 
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abundant mRNA species are present in eukaryotic cells at high pM or low nM 
concentration. The KM of RISC is likely determined primarily by the strength of its 
interaction with the target RNA, because the KM is nearly identical to the Ki of a non- 
cleavable 2'-0-methyl oligonucleotide inhibitor. 

5 Recently, a study of the kinetic parameters of target RNA cleavage by human 

RISC was described (Martinez et al., 2004). La that study, the minimal active human 
RISC was highly purified; in this study, Drosophila RISC activity was measured for the 
unpurified, intact holo-RISC, believed to be an SOS multi-protein complex (Pham et al., 
2004). DiflFerent siRNAs were used in the two studies. Nonetheless, the KM and kcat 

1 0 values rq)orted here and for the minimal human RISC are remarkably similar: the KM 
was 2.7-8.4 nM and the kcat was 7.1 x 10-3 sec-1 for the let-7 siRNA-programmed 
Drosophilaholo-RISC versusaKM of 1.1-2.3 nMandakcat of 1.7x 10-2 sec-1 fora 
different siElNA in minimal human RISC. As in this study, a pre-steady-state burst was 
observed in the absence of ATP, consistent with the idea that product release is ATP- 

15- assisted in vivo. 

The ratio of kcat to KM is a classical measure of enzyme efficiency and 
corresponds to the second order rate constant for the reaction when the concentration of 
substrate is much less than the KM. For the let-7 programmed RISC, kcat KM-1 equals 
-8.4 X 105 M~l -1 (-8.4 x 10-4 nM-1 s-1), a value far slower than the expected rate of 

20 collision of RISC -1 witii mRNA, =107 M-1 s. It is possible that the rate of catalysis by 
RISC is constrained by the rate of conformational changes required for formation of the 
enzyme-substrate complex or by subsequent conformational rearrangements required for 
catalysis. It is possible that siRNAs can be designed that significantiy improve either the 
kcat or KM of RISC without compromising specificity. 

25 Although siRNAs are typically envisioned to bind their target RNAs through 19 

to 21 complementary base pairs, we find that the 5 central, and 3 ' regions of the siRNA 
make distinct contributions to bmding and catalysis (Figure 18). Measurements of KM 
and Ki suggest that the 5 ' nucleotides of the siRNA contribute more to target binding 
than do the 3' nucleotides. At least for the siRNA examined here, the first three and die 

30 last five nucleotides of a 21 nt siRNA contribute litfle to binding. If the KD of RISC 
bound to its target RNA is essentially its KM, -8 nM, then the firee energy (.G*' = -RT 
lin KD) of the let-7-piogrammed RISC:target interaction is approximately -1 1 kcal mol- 
1, considerably less than the -35 kcal mol-1 (KD -10-29) predicted32 for the let-7 
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RNA bound to a fuUy complementary RNA in 100 mM K-f and 1.2 mM Mg2+ at 25°C. 
It is possible that RISC discards potential binding energy by binding less tightly to its 
target, an siRNA in RISC gains the ability to discriminate between well matched and 
poorly match targets, but only for bases in the 5 ' region of the siRNA guide strand. 
5 Mismatches between the central and 3 ' regions of an siRNA and its target RNA 

reduce kcat far more than mismatches at the 5 ' end of the siRNA. These results fit well 
with recent findings by Doench and Sharp that translational repression by siRNA, 
designed to act like animal miRNA, is dramatically disnq)ted by mismatches with the 5 ' 
end of the siRNA, but not with similar mismatches at the 3 ' endl 8. These authors 
propose that miRNA binding is mediated primarily by nucleotides at the 5 ' end of the 
small RNA. In fact, complementarity between the S' end of miRNAs and their targets 
has been required by all computational approaches for predicting animal miRNA targets 
(Rajewsky et al., 2004; Lewis et al., 2003; Stark et ai., 2003; Enright et al., 2003). The 
instant discovery that central and 3 ' siRNA sequences must pair with the target sequence 
for effective target cleavage but not for target binding reinforces this view; both central 
and 3 ' ndRNA sequences are usually mismatched with their binding sites in their natural 
targets (Lee et al., 1993; Reinhart et al., 2000; Brennecke et al., 2003; Abrahante et al., 
2003; Vella et al., 2004; Xu et al., 2003; Johnston et al., 2003). 

Formation of a contiguous A-form helix surrounding the scissile phosphate of flie 
target noRNA has been proposed to be a quality control step for RISC-mediated target 
cleavage (Chiu et al., 2003). The instant invention discovers that RISC can direct 
cleavage when the siRNA is paired with the target RNA only at nts 2-12 of the gxiide 
strand, corresponding to one complete turn of an RNAiRNA helix. This region of the 
siRNA includes nts 2-8, which appear to be critical for miRNA recognition of mRNAs 
targeted for translational repression, plus two nts flanking either side of the scissile 
phosphate. The instant invention further discovers unpairing the jSrst nt of the guide 
strand enhances the activity of siRNAs with seven, eight or nine y mismatches to the 
RNA target is striking, since many miRNAs do not pair with their targets at this 
position. Furthermore, such pairing resembles that reported by Linsley and colleagues 
for siRNA-directed off-target effects in cultured mammalian cells (Jackson et al., 2003). 

The requirement for a full turn of a helix may reflect a mechanism of 'quality 
control' by RISC. Since RISC can apparently assemble on any siRNA sequence, it must 
use the stracture of the siRNA paired to its target to determine whether or not to cleave. 
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Despite the apparent surveillance of the structure of the siKNA/target pair, the identity 
of the scissile phosphate is unaltered by extensive mismatch between the 5 ' end of the 
siRNA and its target Yet the scissile phosphate is determined by its distance from the 5' 
end of the siRNA guide strand (Elbashir et al., 2001; Elbashir et aL, 2001). The simplest 
5 explanation for the instant discovery is that the scissile phosphate is identified by a 
protein loaded onto the siRNA during RISC assembly, te,, before the encounter of the 
RISC with its target RNA. 

The remarkable tolerance of RISC for mismatches between the siRNA and its 
targets — ^up to nine contiguous 3' nucleotides — ^implies that a large number ofif-target 

10 genes should be expected for many siRNA sequences when RISC is present in excess 
over its RNA targets. However, RISC with extensive mismatches between the siRNA 
and target are quite slow to cleave, so off-target effects may be minimized by keeping 
the amount of RISC as low as possible. These imdarstandings of the molecular basis of 
siRNA-directed gene silencing assist the skilled artisan in creating siRNAs designed to 

1 S balance the competing demands of siRNA efGcacy and specificity. 

Experimental Procedures 

A. General methods 

Drosophila embryo lysate preparation, in vitro RNAi reactions, and c^-labeling 
20 of target RNAs usmg Guanylyl transferase were carried out as previously described 
(Tuschl et al., 1999; Zamore et al., 2000). Target RNAs were used at - 5 nM 
concentration to ensure that reactions occurred under single-turnover conditions. Target 
cleavage under these conditions was proportionate to siRNA concentrations. Cleavage 
products of RNAi reactions were analyzed by electrophoresis on 5% or 8% denaturing 
25 acrylamide gels. 5' end labeling and determination of siRNA unwinding status were 
according to Nykanen et al. (Nykanen et al., 2001) except that unlabeled competitor 
RNA was used at 100-fold molar excess. Gels were dried, then exposed to image plates 
(Fuji), which were scanned with a Fuji FLA-5000 phosphorimager. Images were 
analyzed using Image Reader FLA-5000 version 1.0 (Fuji) and Image Gauge version 
30 3 .45 or 4. 1 (Fuji). Data analysis was performed using Excel (Microsoft) and IgorPro 5.0 
(Wavemetrics). 
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B. Drosophila embryo lysate, siRNA labeling with polynucleotide kinase (New 
England Biolabs), target RNA preparation and labeling with guanylyl transferase were 
carried out as described (Hutv^er et al., 2002, Haley et al., 2003) and the forward 
primer sequence for 379 nt target mRNA was 5 '-CGC TAA TAG GAC TCA CTA TAG 

5 €AG TTG GCG CCG CGA ACG A-3 ', and 5 '-GCG TAA TAG GAC TCA CTA TAG 
TCA CAT CTC ATC TAG CTG C-3 for the 182 nt target. Reverse primers used to 
graerate fully matched and mismatched target RNAs were: 5 '-CCC ATT TAG GTG 
ACA CTA TAG ATT TAG ATC GCG TTG AGT GTA GAA CGG TTG TAT AAA 
AGG TTG AGG TAG TAG GTT GTA TAG TGA AGA GAG GAG TTC ATG ATC 
.10 AGT G-3' (perfect match to let-7); 5'-CCC ATT TAG GTG ACA CTA TAG ATT TAG 
ATC GCG TTG AGT GTA GAA CGG TTG TAT AAA AGG TTG AGG TAG TAG 
GTT CAT GCA GGA AGA GAG GAG TTC ATG ATC AGT G-3'(7 nt 3 ' mismatch); 
5'-CCC ATT TAG GTG ACA CTA TAG ATT TAG ATC GCG TTG AGT GTA GAA 
CGG TTG TAT AAA AGG TTG AGG TAG TAG GTA GAU GCA GGA AGA GAG 

15 GAG TTC ATG ATC AGT G-3' (8 nt 3' mismatch); 5'-CCC ATT TAG GTG ACA 
CTA TAG ATT TAG ATC GCG TTG AGT GTA GAA CGG TTG TAT AAA AGG 
TTG AGG TAG TAG GAA CAT GCA GGA AGA GAG GAG TTC ATG ATC AGT 
G-3' (9 nt 3 ' mismatch); 5'-CGC ATT TAG GTG ACA CTA TAG ATT TAG ATG 
GCG TTG AGT GTA GAA CGG TTG TAT AAA AGG TAG TCC ATG TAG GTT 

20 GTA TAG TGA AGA GAG GAG TTC ATG ATG AGT G-3 '(8 nt 5 'mismatch); 5 '- 
CGG ATT TAG GTG ACA CTA TAG ATT TAG ATC GCG TTG AGT GTA GAA 
CGG TTG TAT AAA AGG TAG TCG TAG TAG GTT GTA TAG TGA AGA GAG 
GAG TTC ATG ATC AGT G-3' (4 nt 5'mismatch). In Figures 13, 14, 15, 17A, 19 and 
20A, the target sequence was 613 nt long; 379 nt in Figures 16A-C,17B and 20B; and 

25 1 82 nt in Figure 16D. All siRNAs were deprotected according to the manufacturer's 
protocol (Dharmacon), 5 '-radiolabeled where appropriate, then gel purified on a 15% 
denaturing polyacrylamide gel. 2'-0-methyl oligonucleotides were from Dharmacon. 
siRNA strands were annealed at high concentrations and serially diluted into lysis buffer 
(30 nM HEPES pH 7.4, 100 mM KOAc, and 2 mM MgG12). Gels were dried and 

30 imaged as described (Sdbiwartz et al., 2003). Images were analyzed using Image Gauge 
4,1 (Fuji), ihiitial rates were determined by linear regression using Excel X (Microsoft) 
or IgorPro 5.01 (Wavemetrics). Kaleidagr^h 3.6.2 (Synergy Software) was used to 
determine KM and Ri by global fitting to the equations: V = (Vmax xS)(KM + S)-l and 
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V = (Vmax xKi(app))(Ki(app) + I)-l > where V is velocity, S is target RNA 
concentration, and I is the concentration of 2'-0-methyl oligonucleotide competitor. Ki 
was calculated by correcting Ki(app) by the KM and substrate concentration, Ki = 
Ki(^p)(l+(S KM-1))-1. 

5 

C. siRNA preparation 

Synthetic RNAs (Dharmacon) were deprotected according to the manufacturer's 
protocol. siRNA strands were annealed (Elbashir et al., 2001 a) and used at 50 nM final 
concentration unless otherwise noted. siRNA single strands were phosphorylated with 
10 polynucleotide kinase (New England Biolabs) and 1 mM ATP according to the 
manufacturer's directions and used at 500 nM final concentration. 

D. Target RNA preparation 

Target RNAs were transcribed with recombinant, histidine-tagged, T7 RNA 
15 Polymerase from PGR products as described (Nykanen et al., 2001 ; Hutvdgner and 
Zamore, 2002), except for sense sodl mRNA, which was transcribed from a plasmid 
template (Crow et al., 1997) linearized with Bam HI. PGR templates for htt sense and 
anti-sense and sodl anti-sense target RNAs were generated by amplifying 0. 1 ng/ml 
(final concentration) plasmid template encoding htt or sodl cDNA using the following 
20 primer pairs: htt sense target, 5'-GGG TAA TAG GAG TGA GTA TAG GAA GAG 
TAT GTG TGA GAG ATG-3 ' and 5 '-UUGG AAG UAU UGG GGG UAC GU-3 htt 
anti-sense target, 5 '-GCG TAA TAG GAG TGA GTA TAG GAG AAG CGT AAT TAG 
TGATGG-3 ' and 5 '-GAA GAG TAT GTG TGA GAG ATG-3 sodl anti-sense target, 
5'-GGG TAA TAG GAG TGA GTA TAG GGG TTT GTT AGG AGC GGG AT-3 ' and 
25 5'-GGG AGA GGA GAA GGG TTT GGG-3'. 

Immobilized 2'-0-mediyl oligonucleotide capture of RISG 

The 5 ' end of the siRNA strand to be measured was 32 P-radiolabeled with PNKL 

10 pmol biotinylated 2'-0-Methyl RNA was immobilized on Dynabeads M280 (Dynal) 

30 by incubation in 1 0 ml lysis buffer containing 2mM DTT for 1 h on ice with the 

equivalent of 50 ml of the suspension of beads provided by the manu&cturer. The beads 

were then washed to remove unbound oligonucleotide. 50 nM siRNA was pre-incubated 
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in a Standard 50 ml in vitro RNAi reaction for 15 min at 25°C. Then, all of the 
immobilized 2'-0-Methyl oligonucleotide was added to the reaction and the incubation 
continued for 1 h at 25^C. After incubation, the beads were rapidly washed three times 
with lysis buffer containing 0.1% (w/v) NP-40 and 2 mM DTT followed by a wash with 
5 the same buffer without NP-40. Input and bound radioactivity were deteraiined by 
scintillation counting (Beckman). The 5'-biotin moiety was linked via a six-carbon 
spacer arm. 2'-0-methyl oligonucleotides (E)T) were: 5'-biotin-ACA UUU CGA AGU 
AUU CCG CGU ACG UGA UGU U-3 ' (to capture the siKNA sense strand) 5 '-biotm- 
CAU CAC GUA CGC GGA AUA CUU CGA AAU GUC C-3 ' (to capture the anti- 
10 sense strand). 

mfold Analysis 

To model the end of an siKNA, the following 16 nt RNA sequence were 
submittedto mfold 3.1: {37^C, 1 MNaCl): CGUACUUUUGUACGUG,UGU ACU 
1 5 UUU GUA CGU G, and UCG AAU UU UUC GAA A. 

Yxe-let-1 Processing 

Pre-te/-7 RNA was incubated with N-tenninal histadine-tagged, human Dicer 
20 according to the manufacterer's directions (Gene theraphy Systems) or in a standard 
Drosophilia embryo in vitro RNAi reaction as described previously (Hutvagner etal., 
2001; Hutvagner and Zamore, 2002), 

Northern Hybridization 

25 Northern hybridization was essentially as described (Hutvagner et al., 2001). 50 

mg total RNA was loaded per lane. 5 ' 32 P-radiolabeled synthetic RNA probes 
(Dhannacon) were: 5 '-ACA AAU UCG GAU GUA CAG GGU-3 ' (to detect miR-10) 
and 5'-AAA ecu CUC UAG AAC CGA AUU U-3 ' (to detect miR-10*). The amount 
of miR-10 or miR-10* detected was normalized to the non-specific hybridization of the 

3 0 probe to 5S rKNA. Normalizing to hybridization of the probe to a Icnown amount of a 
miR-10 or miR-10* synthetic RNA control yielded essentially the same result 
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ATP-depletion and N-ethyl maleimide (NEM) Inhibition 

RNAi reactions using Drosophila embryo lysate were as described (Haley et al., 
2003). To compare *ininus* and *plus' ATP conditions, sauries were treated with 10 
mM NEM (Pierce) for 10 min at 4**C, then the NEM was quenched with 1 1 naM 
5 dithiothreitol (DTT). For ATP depletion (-ATP), 1 unit-of hexokinase and 20 mM (final 
concentration) glucose WCTe added and the incubation continued for 30 min at 25^C. For 
*plus' ATP reactions, 0.05 mg mH (final concentration) creatine kinase and one-tenth 
volume H20 substituted for hexokinase and glucose. The addition of firesh creatine 
kinase after NEM treatment did not rescue the defect in RISC assembly, but did restore 
10 ATP to high levels (Nyktoen et al., 2001). ATP levels were measured using an ATP 
assay kit (Sigma) and a PhL luminometer (Mediators Diagnostika). 
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20 Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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What is claimed: 

1 . A method of enhancing the ability of a first strand of a KNAi agent to act 
as a guide strand in mediating RNAi, comprising lessening the base pair strength 
between the 5* end of the first strand and the 3* end of a second strand of the duplex as 
compared to the base pair strength between the 3* end of the first strand and the 5' end 
of the second strand. 

2. A method of enhancing the efficacy of a siKNA duplex, the siRNA 
duplex comprising a s^e and an antisense strand, comprising lessening the base pair 
strength between the antisense strand 5* end (AS 5*) and the sense strand 3' end (S 3') as 
compared to the base pair strength between the antisense strand 3' end (AS 3*) and the 
sense strand 5* end (S '5), such that efficacy is enhanced. 

3 . A method of promoting entry of a desired strand of an siKNA duplex into 
a RISC complex, comprising enhancing the asymmetry of the siKNA duplex, such that 
entry of the desired strand is promoted. 

4. The method of claim 3, wherein asymmetry is enhanced by lessening the 
base pair strength between the 5* end of the desired strand and the 3' end of a 
complementary strand of the duplex as compared to the base pair strength between the 
3* end of the desired strand and the 5' end of the complementary strand. 

5. The method of claim 1 or 2, wherein the base-pair strength is less due to 
fewer G:C base pairs between the 5' end of the first or antisense strand and the 3' end of 
the second or sense strand than between the 3' end of the first or antisense strand and the 
5' end of the second or sense strand. 

6. The method of claun 1 or 2, wherein the base pair strength is less due to 
at least one mismatched base pair between the 5' end of the first or antismse strand and 
the 3' end of the second or sense strand. 

7. The method of claim 6, wherein the mismatched base pair is selected 
firom the group consisting of G:A, C:A, C:U, G:G, A:A, C:C and U:U. 
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8. The method ofclaim 6, wherein the mismatched base pair is selected 
from the group consisting of G:A, C:A, C:T, G:G, A:A, C:C and U:T. 

9. The method of claim 1 or 2, wherein the base pair strength is less due to 
at least one wobble base pair between the 5' end of the first or antisense strand and the 
3' end of the second or sense strand. 

10. The method of claim 9, wherein the wobble base pair is G:U. 

11. The method of claim 9, wherein the wobble base pair is G:T. 



12. The method of claim 1 or 2, wherein the base pair strength is less due to: 

(a) at least one mismatched base pair between the 5 ' end of the first or 
15 antisense strand and the 3' end of the second or sense strand; and 

(b) at least one wobble base pair between the 5 ' end of the first or antisense 
strand and the 3' end of the second or sense strand. 



13. The method of claim 12, wherein the mismatched base pair is selected 
20 from the group consisting of G:A, C:A, C:U, G:G, A:A, C:C and U:U. 

14. The method of claim 12, wherein the mismatched base pair is selected 
from fte group consisting of G:A, C:A, C:T, G:G, A:A, C:C and U:T. 

25 1 5. The method of claim 12, wherein the wobble base pair is G:U. 

16. The method of claim 12, wherein the wobble base pair is G:T. 

17. The metihiod of claim 1 or 2, wherein the base pair strength is less due to 
30 at least one base pair comprismg a rare nucleotide. 

18. The method of claim 12, wherein the rare nucleotide is inosine (I). 

19. The method of claim 1 8, wherein the base pair is selected from the group 
35 consisting of an LA, I:U and 1:0. 



20. The method of claim 1 or 2, wherein the base pair strength is less due to 
at least one base pair comprising a modified nucleotide. 

-97- 



wo 2005/001043 



PCT/US2004/017256 



21 . The method of claim 20, wherein the modified nucleotide is selected 
firom the group consisting of 2-amino-G, 2-amino-A, 2,6-diamino-G, and 2,6-diainino- 
A. 

5 

22. The method of claim 1 , wherein the KNAi agent is a siRNA duplex. 

23. The method of claim 1 or 2, wherein the RNAi agent or siRNA duplex is 
chemically synthesized. 

10 

24. The method of claim 1 or 2, wherein the RNAi agent or siRNA duplex is 
enzymatically synthesized. 

25. The method of claim 1 or 2, wherein the RNAi agent or siRNA duplex is 
1 5 derived from an engineered precursor. 

26. A method of enhancing silencing of a target mRNA, comprising 
contacting a cell having an RNAi pathway with the RNAi agent or siRNA duplex of any 
one of the preceding claims under conditions such that silencing is enhanced. 

20 

27. A method of enhancing silencing of a target mRNA in a suhject, 
comprising administering to the subject a pharmaceutical composition comprising the 
RNAi agent or siRNA duplex of any one of the preceding claims such that silencing is 
enhanced. 

25 

28. A method of decreasing silencing of an inadvertant target mRNA by a 
dsRNAi agent, the dsRNAi agent conoprising a sense strand and an antisense strand 
comprising: 

(a) ' detecting a significant degree of complementarity between the sense 
30 strand and the inadvertant target; and 

(b) enhancing the base pair strength between the 5* end of the sense strand 

and the 3 * end of the antisense strand relative to the base pair strength betweooi 

the 3' end of tfie sense strand and the 5 ' end of the antisense strand; 

such that silencing of the inadvertant target mRNA is decreased. 
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29. The method of claim 28, wherein silencing of the inadvertant target 
mRNA is decreased relative to silencing of a desired target mRNA. 

5 30 An siKNA duplex comprising a sense strand and an antisense strand^ 

wherein the base pair strength between the antisense strand 5' end (AS 5') and the sense 
strand 3' end (S 3') is less than the base pair strength between the antisense strand 3' end 
(AS 3') and the sense strand 5' end (S *5), such that the antisense strand preferentially 
guides cleavage of a target mRNA. 

10 

3 1 . The siElNA duplex of claim 1 , wherein the base-pair strength is less due 
to fewer G:C base pairs between the AS 5' and the S 3' than between the AS 3* and the 
S5\ 

1 5 32. The siKNA duplex of claim 30, wherein the base pair strength is less due 

to at least one mismatched base pair between the AS S' and the S 3\ 

33. The siRNA duplex of claim 32, wherein the mismatched base pair is 
selected from the group consisting of G:A, C: A, C:U, G:G, A:A, C:C and U:U. 

20 

34. The siKNA duplex of claim 32, wherein the mismatched base pair is 
selected from the group consisting of G:A, C:A, C:T, G:G, A: A, C:C and U:T. 

35. The siRNA duplex of claim 30, wherein the base pair strength is less due 
25 to at least one wobble base pair between the AS 5' and the S 3*. 

36. The siRNA duplex of claim 32, wherein the wobble base pair is G:U. 

37. The siRNA duplex of claim 32, wherein the wobble base pair is G:T. 

30 

38. The siRNA duplex of claim 30, wherein the base pair strength is less due 
to at least one base pair comprising a rare nucleotide. 

39. The siRNA duplex of claim 38, wherein the rare nucleotide is inosine (T). 

35 

40. The siRNA duplex of claim 39, wherein the base pair is selected from the 
group consisting of an I: A, I:U and I:C. 
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41 . The siRNA duplex of claim 30, wherein the base pair strength is less due 
to at least one base pair comprising a modified nucleotide. 

5 42. The siRNA duplex of claim 41 , wherein the modified nucleotide is 

selected firom the group consisting of 2-amino-G, 2-amiBO-A, 2,6-diamino-G, and 2,6- 
diamino-A. 

43. A composition comprising the RNAi agent or siRNA duplex of any one 
10 of the preceding claims, formulated to facilitate entry of the RNAi agent or siElNA 

duplex into a cell. 

44. A pharmaceutical composition comprising the RNAi agent or siRNA 
duplex of any one of the preceding claims. 

15 

45. An engineered pre-miRNA comprising the RNAi agent or siRNA duplex 
of any one of the preceding claims. 

46. A vector encoding the pre-miRNA of claim 45. 

20 

47. A pri-miRNA comprising the pre-miRNA of claim 46. 

48. A vector encoding the pii-miRNA of claim 47. 

25 49. A small hairpin RNA (shRNA) comprising nucleotide sequence identical 

to the sense and antisense strand of the siRNA duplex of any one of the preceding 
claims. 

50. The shRNA of claim 49, wherein the nucleotide sequence identical to the 
30 sense strand is upstream of the nucleotide sequence identical to flie antisense strand. 

5 1 . The shRNA of claim 49, wherein the nucleotide sequence identical to the 
antisense strand is upstream of the nucleotide sequence identical to the sense strand. 

35 52. A vector encoding the shRNA of any one of claims 49-51. 

53. A cell comprising the vector of any one of claims 46, 48 or 52. 
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54. The cell of claim 53, which is a mammalian cell. 

55. The cell of claim 53, which is a human cell. 

5 56. A transgene encoding the shRNA of any one of claims 49-5 1 
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FIGURE 2 
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FIGURE 3 
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FIGURE 6 
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FIGURE 7 
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FIGURE 10 
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FIGURE 1 1 
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FIGURE 12 
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FIGURE 14 



Ibiget -f perfect match stRNA 

3' - . . , CCAACUCCADCADCCAACAnAUCACOU . . . 

niiitiiMitiiuiiiTT 

1 rtt 3' mismatched eIRNA 

UOAcdcCAUCAUCCAACAOAtr 

2 nt 3' ndsmatched sjRNA 

3 nt 3' mismatched slRNA 

* 5. •UGlAG6TIAGaA/5(jUUG(JiAl)^^*3 
DUCOTCCAUCAUCCAACAUAG 

4 nt 3' mismatched siRNA 
UOCCDCCAOCHDCCAACAUCG 
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FIGURE 16 
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FIGURE 19 
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FIGURE 20 
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FIGURE 21 



Target and siRNA 


Flgurie # 


target : fully matched siRNA 

5'-lK5AGGO7U3lIAGGDDG0AnAGU-3 ' 
3 ' -UUACDCCAUCAUCCAACAUAU- S ' 






taiget : 1 nt 3' mismatched sIRNA 






tJUACUCCAUCaOCCAACaUAU 




target : 2 lit T mismatched sIRN/l 

XJUCCOCCAUCRDCCAACAUAU 
target : 3 nt 3' mismatched sIRNA 

5 -DSAQGnAGUAGQUUGU!^ 

uucqjccAacAnccAACAnAG 








Figures 2b-d, 




3a,band5a 


.-5' 


target : 4 nt 3' mismatched stRNA 

, iiiiiiTmimiii. 

troOCdcCAUCAUGCAACAnCG 






target : 5 nt 3' ndsmatched sIRNA^ 

UUCCUCCAUCAUCCAACAACO 






taiget : 6 nt3' mismatched siRNA 
mJOTCCAnCRUCCAACUACG 


• 




taiget : 8 nt 3' mismatched slRNA 

OaCCUCC2aJCA0CCAA60AC6 


..-S' 


Figure 3c 


target : 10 nt 3' mismatched siRNA 

3' - . . . CCAACUCaOKZAIJCCAACAnADCACDD . 
5 ' - UGA06d1gcJIgCAAC^ 


..-s- 




UUCCUCCAUCAUCCDUGOAOS 
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target : 1 nl 5' mismatched* sIRNA 

3 - * . . CCAACOCCADCAUCCaM^CWIAnCSl^ , . . -5^ 

iiiiiiniKiniUM ^ 




target : 3 nt 5' mismatched sOVIA 
3 V . wCCAACoccayDCMJoauvcaana^ 

itiniMiiiiinni ^ 

5 " -ACDGGOnsaiUSGDDGaAIIIkGa-B 




target : 5 nt mismatched ^IWA 

- 3 ' - . . . CC5lACDCC3MOCMJOC3kAC3aiauCA^ ... - 5 ' 

iiiiiiniHiini ^ 


Figures 4a, d and 5a 


iaiget : 6 nt 5' mismatched siRNA' 
3 . . .ccAAc□ccA^c^^ccAACA^lalCACTO . . -s' 

intiniiiiitM 

5 " - ACDCCAAGUAGOTDGOiVDAGO- 3 
DOCGftGGDUCAXTCCA&CAXnUI • 




target : 7 m 5' mismatched stRNA 
. 3'- . . . cciuicaccAucMJcauvcAmucivcoo. . , -s** 
linillttlMII ^ 

S ' - ACUCCADGOAGGUUtSOADAGU- 3 
XmCGACGOACAUCCAACAmD 




target : 8 m 5* mismatched sfflNA 

3 . . .CCAAC0CGAnC3aJCCA\CKDAUCACro» . . -5 ' 

lIllMllillM 

5 ' - AOJCCATJCUAGGaUGUADAGU- 3 ' 
UUCJSAGGUAGAUCCAACAUAU 




8 nt T mismatched target : siRNA 

' 3'- . . .CCARl^aC^UCAWXA^^ 

UUCGDCCAUCAXJCGAACAnAO 




8 lit 5' mismatched target : sfRHA 
3' - . . .cainGA6GaAaAU^cA]^ij9|n(A . ' 

5 ' ^T76AiGGDA0nAC6DUGQAXnkGn7>3 ' 
UuCXTuCCAXIChuCCjwCAu^ 


Figure 4b 


8 rt 5' mismatched target : 8 nt 5' mismatch sfRNA 

5 ' - ACOCCAUCORGGlKJGaftaaGU- 3 
DDaSAGGUAGAUCCAACAUAa 
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siRNA : 7 nt 3' mismatched target 

UUCCUCCAUCAUCCAACAUAU 

1 nt 5' mismatched sIRNA : 7 nt 3' mismatched target 
3 - - , . , ccju^aay^yc^GtiRCCTccmj . . . - s' 

UUCCTCCAOCADCCXACAUAU 

2 nt 5' mismatched sIRNA : 7 nt 3' mismatched target 

S' -A6kGGUAGU3tf3GDnaUADAOTJ-3 ' 

3 nt-6' mismatched sIRNA : 7 nt 3' mismatched target 
3 - -'. . . ccaJiCTcaujCACCCM^^ . - s ' 

4 nt 5* mismatched sIRtJA : 7 nt 3* mismatched target 

tmCOACCAUCKQCCAACAnAU 
sIRNA : 8 ntS' mismatched target 

3 ' - . . . CCAAnJC<^YC&YCCRlJGUACGUCa;U ... -5' 
UUCCnCCAUCAUCCAACAUAl? 

1 nt 5* mismatched sIRNA : 8 nt 3' mtsmatcheded target 

UUCCUCCAUCAOCC^CAUAD 

2 nt 5* mismatched sIRNA : 8 nt 3' mismatched target 
3 ' - . . . ccAAfly^ WWy9W^^A°^'^cou. . . -5' 

5 ' - AGJ^GGUAiyVG^^ - 3 ' 

UlJCCUCCAUCAUCCAACAUiAp 

3 nt 5' mismatched sIRNA : 8 nt 3' mismatched target 
3 ' - . . . CCAACTCCAyCMrCCAUGUACaUCC^^ . . -5' 

WCGACCAOCAUCCAACMXAU 

4 nt 5' mismatched siRNA : B nt 3' mismatched target 

3'- . . .CCAACUCCAymyCXIADGUACGUCCUD, . .-5' 
5 * - ACJCanAGnAOGOU 3 ' 

mraSACCAtJCAUCCAACAlJA^ 

sIRNA : 9 nl 3' mismatched target 
3 " - . . 1 CCAACyCCAYCAPC yUUUUA COTCCtn? . . . -5 ' 
5 ' •^Dt^QGOAoMG^ 3 ' 

UUOCUpCAnCAUCCMiCMAO 

1 nt mismatched elRNA : 0 nt 3' mismatched target 

UUCCOCCAUgAUCCAACAUAU 

2 nt 6' rrilsmalched elRN.A : 9 nl 3' mismatched target 

3 ' - . . . GCaiAC^CAyCADCXrDUCUAC^ . . - s ' 

S ' - AGAciuAGgAGGOCGUAUAGU- 3 ' 
UUCCUCJ^DCAUCOVACAUAa 

3 nt 5' mismatched siRN A : 9 nt 3' mismatched target 
3'-. . .CCAAroyCAyCM/COTUGiaCOTCC^ . .-5* 

5 ' -ACUGGOwro^ Wro 3 ' 

UUCOACCAUCAUCa^CAUAi; 

4 nt 5' mismatched sIRNA : 9 nt 3' mismatched target 

3'-'. . XCAACUCCAUCAyiJcUUGUACOTCaro . .-5' 

S'-ACpCOTAOTJ^^ 
nUCSQACCAUCAUCCAACAUAV 
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perfect matched 2'O-methyl taiget : siRNA 

4 fit r mismatched. 2'-0-methyl target : siRNA 

S ' - ACOCGOAGOAGGUDOaAtAG^ ' 
tlDOGA^GDAGAUGCAACA^ 

8 nt 5' irOsmatcbed 2'-0-methyt target r stRNA 
S'-ACTCCOTCgMfiDg^^ 

4 nt 5' central mismatched 2 -O-methyl target : siRNA 

UqACUCCRUCAUCCAACATOa 

4 nt 3* central mismatched 2'-0-methyl taiget : siRNA 

5 ''-OSAGGnAGnAi^^ " 
OQACaCCAUCAUCCAACAUAU 

4 nt 3' misnnatched 2 -6-methyt target : siRNA 

- -Li J t J. 1 1 1 1 1 1 i 1 1 1 1 1 • 

5 ''VGAGSOAGaAGGyO^ 
tnXACDCCAUCAUCCAACAUA^ 

7 nt 3' mismatched 2 -O-methyl taiget : sIRNA 
3 ' -tnjca^CTOjiAYCAnoc^^ 

5 ' -DGAGGUAGOAG^^ " 
imCOCCAUCAUoEAACWA^ 

6 nt 3' mismatched 2'-0-methyt taiget : siRNA 

S'-UQAGGOIUSUA^ ' 
UUCcdcCAVCAUCCAaG^ 



Figure 5c 
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